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SEQUENCES 



The present invention relates to novel proteins and nucleotide sequences encoding 
said peptides, pharmaceutical compositions containing or targeting said peptides or 
5 nucleotide sequences, assays utilising said peptides or nucleotide sequences and 
methods of detecting the presence or absence of said proteins and polynucleotide 
sequences encoding them. 



Eosinophil cationic protein (ECP) is a cationic toxin stored in the large specific 
10 granules of human eosinophilic leukocytes. It is believed to be a member of the 
ribonuclease family (Slifinan NR et al. J Immunol. (1986) 137, 2193-2917) now 
designated as RNASE 3. 



Background teaching on ECP has been presented by Victor A. McKusick in "Online 
15 Mendelain Inheritance in Man (OMEVTT, John Hopkins University, Baltimore, MD. 
MIM Number 131398 (last edited 16 September 1997) on 
www.ncbi.lm.nih.gov/Omim; 



"Eosinophil cationic protein, like eosinophil-derived neurotoxin, is localized to 

20 the granule matrix of the eosinophil. Both proteins possess neurotoxic, 

helminthotoxic, and ribonucleolytic activities. The cDNA sequences of the 
genes encoding these 2 proteins and the deduced amino acid sequences 
indicate that the genes belong to the ribonuclease gene superfamily. Hamann 
et al. (Genomics 7: 535-546, 1990) demonstrated remarkable similarities of 

25 eosinophil-denVed neurotoxin and eosinophil cationic protein, symbolized 

RNS2 and RNS3, respectively, in noncoding sequences, introns, and flanking 
regions, as well as the previously known coding regions. A single intron in the 
5-prime untranslated region and an intronless coding region appear to be 
features common to many members of the RNase gene superfamily. Both 

30 genes were mapped to 14q24-q31 by a combination of analysis of somatic 

cell hybrids and in situ hybridization (Hamann et al., 1990 supra.). Mastrianni 
et al. (Genomics 13: 240-242, 1992) confirmed the assignment of both RNS2 
and RNS3 to chromosome 14 by Southern analysis of DNAs from mouse- 
human cell hybrids. They commented on the fact that another eosinophil 

35 granule protein is encoded by a gene on chromosome 19 and yet another, 

eosinophil peroxidase, by a gene on chromosome 17." 



ECP has a number of interesting biological activities of which its cytotoxic activity 
may be the most conspicuous. However, it is also capable of affecting fibroblasts in 
40 their production of collagen and other proteoglycan molecules (Hernas J, Sarnstrand 
B> Lindroth P 5 Peterson CGP, Venge P, Malmstrom A. Eosinophil cationic protein 
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alters proteoglycan metabolism in human lung fibroblast cultures. Eur J. Cell Biol. 
1992;59:352-63), a finding that supports the notion of eosinophils actively 
participating in tissue repair processes and in fibrotic diseases. 

5 Some heterogenicity has been noted in the molecular weight of ECP and this is 
believed to be explained by difference in the pattern of glycosylation of the protein. 
The ribonuclease and anti-bacterial effects of ECP have been investigated on a 
genotypic level by Rosenberg HF (J Biol Chem (1995) 270, 7876-7881) who by 
comparing wild-type and mutant ribonuclease defective recombinant ECP has shown 
1 0 that the mutant ribonuclease defective form retained anti -bacterial activity. The two 
mutant forms were generated by single base pair conversions at positions 517 (C to G 
resulting in the amino acid conversion His 128 to Asp) and 248 (A to G resulting in the 
amino acid conversion Lys38 to Arg). 

15 It has now been observed that of the different ECPs described in the art, a specific 
class of ECP bears a genotypic variation which is believed to be phenotypically 
advantageous. The present invention relates to such ECP proteins, as will be defined 
hereinafter, together with nucleotide sequences encoding said proteins. 

20 The genotypic difference that characterises the proteins of the present invention, arises 
from a single nucleotide polymorphism at position 926 of the nucleotide sequence for 
ECP given as Genbank accession number XI 6545 (SEQ ED No. 1). This results in 
replacement of arginine residue at position 97 in the expressed ECP amino acid 
sequence by a threonine residue. 

25 

Thus the present invention relates to the use of a protein selected from; 
(a) an ECP, 
30 (b) a mutant of said ECP, and 
(c) a fragment of (a) or (b), 
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said protein lacking an arginine 97 of wild-type ECP or its equivalent, in the 
preparation of a medicament for the prevention and/or treatment of cancer. 

Thus, the ECP for use in the present invention may comprise a modification at amino 
acid residue 97 or its equivalent such that residue 97 or its equivalent is any other 
amino acid other than arginine. 

The term "wild-type ECP (WT-ECP)" is used to refer to the protein sequence 
expressed by the DNA sequence shown herein as SEQ ID No. 1, having an arginine 
residue at position 97 (Arg97) and shown herein as SEQ ID No. 2. All reference to 
ECP protein of SEQ ID No. 2 or as expressed by SEQ ID No. 1 refer to the sequence 
of mature protein i.e. excluding the signal peptide. The relevant regions of the 
polynucleotide/protein sequences are identified in the sequence listing below. 

The term "an ECP" as used herein in terms of the present invention, means an ECP 
protein wherein at least the amino acid residue corresponding to Arg97 of wild type 
ECP is modified. 

A "mutant of said (an) ECP" is used to mean any variant or homologue of the ECP of 
the present invention, such variants and homologues being defined below. 

The term "an ECP of the present invention' 5 is used to include options (a) , (b) and (c) 
given above. 

Any of the above terms followed by "encoding polynucleotide" means a 
polynucleotide sequence capable of encoding the said ECP. 

It has been observed that the presence of an ECP of the present invention in a 
mammal, or most preferably a human, provides certain advantages such as protection, 
either partial or preferably complete protection against cancers and cancer-related 
disorders and its absence indicates a predisposition to such disorders. Thus, the 
presence of such an ECP encoding polynucleotide in heterozygous presentation has 
been observed to provide partial protection, whereas, its presence homozygously has 
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been observed to provide complete protection. In particular, ECP has been found to 
be involved in cancers such as Hodgkin's disease. 

Hodgkin's disease (HD) is a lymphoma with a comparatively good prognosis, with a 
5-year survival over 80% (Landis SH, Murray T, Bolden S, Wingo PA. Cancer 
statistics, 1998. CA Cancer J.Clin. 1998;48(l):6-29). The tumour cells in HD, the 
Hodgkin and Reed-Stemberg cells (HRS), are surrounded by a large amount of 
bystander cells, e.g., eosinophils. The amount of eosinophils in HD tumours varies 
within a wide span (Enblad G, Sundstrom C, Glimelius B. Infiltration of eosinophils 
in Hodgkin's disease involved lymph nodes predicts prognosis. Hematol.Oncol. 
1 993; 1 1(4): 187-93; and Toth J, Dworak O, Sugar J. Eosinophil predominance in 
Hodgkin's disease. Z. Krebsforsch. Klin. Onkol. Cancer Res. Clin. Oncol. 
1977;89(1): 107-1 1). The association between infiltration of eosinophils in HD 
tumours and poor prognosis has recently been confirmed in a large study (Enblad et al 
-supra; and von Wasielewski R, Seth S, Franklin J, Fischer R, Hubner K, Hansmann 
ML et al. Tissue eosinophilia correlates strongly with poor prognosis in nodular 
sclerosing Hodgkin's disease, allowing for known prognostic factors. Blood 
2000;95(4):1207-13). High levels of ECP in serum of HD patients has also been 
found, probably originating from eosinophils infiltrating the tumours. It has recently 
been observed that patients with high S-ECP levels had higher ESR and tended to 
have a more advanced stage and more frequently bulky disease than patients with 
normal values. 

Genetic polymorphisms in the gene encoding for ECP have now been observed, as 
described above. Since there is a prognostic relationship between the eosinophil 
participation in Hodgkin's disease and ECP molecules participate in fibrotic processes 
through the affection of fibroblasts, a mutation in the ECP gene may have an impact 
on the course of the disease related to this activity. This could also have an impact on 
pathogenic fibrotic processes in non-cancer disorders such as pulmonary fibrosis and 
scleroderma. 

As discussed above, the invention is based upon the observation that a single base 
mutation giving rise to an amino acid replacement at position 97 of wild type ECP is 
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advantageous. For example, in a preferred embodiment the ECP of the invention 
comprises any amino acid other than arginine, preferably threonine or a biostere 
thereof, at position 97 in place of arginine. In a further preferred embodiment, 
(discussed in detail below) an ECP of the present invention may comprise the 
5 sequence C T Y, corresponding to residues 96-98 of wild-type ECP or a biostere 
thereof The term "biostere" is used as understood in the art, but for example, such 
biosteres will be of the formula C Xj Y, wherein Xi is any amino acid residue other 
than arginine. 

10 A further aspect of the present invention relates to assays for the identification of 
compounds capable of mimicking an ECP of use in the present invention or 
alternatively antagonising the cancer inducing effect of wild-type ECR This aspect 
further includes the compounds identified by such assays. 

Studies carried out in patients suffering from Hodgkin's disease have shown that wild- 
type ECP is highly expressed in sufferers and is associated with the nodular sclerosis 
type of disease. The relationship between presence of the wild-type ECP allele and 
Hodgkin's disease appears to be particularly marked in patient below the age of 40. 
Specifically, it has been found that wild-type ECP is related to the higher stages of 
Hodgkin's disease (i.e. to patients in which the disease is more widespread, affecting 
more than one lymph node and spreading to other organs of the body) and to 
secondary prognostic signs (such as erythrocyte sedimentation rate (ESR) and blood 
hemoglobin (B-Hg) levels). 

25 Thus, reducing cellular levels of wild-type ECP, or preventing expression of wild-type 
ECP altogether, may be used in the prevention of cancer or cancer-related diseases 
and, in particular, eosinophil-related cancers such as Hodgkin's disease. The present 
invention therefore also provides a method of reducing or preventing expression of 
ECP and/or of preventing or reducing ECP activity. In an alternative embodiment the 

30 present invention provides a method of inhibiting ECP activity, preferably within the 
cell. In particular, the present invention provides a method of inhibiting wild-type 
ECP RNase activity. Thus, the present invention also provides antagonists of ECP 
capable of reducing or inhibiting ECP expression and/or activity. As such, there is also 

5 
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provided assays for the identification of ECP antagonists. ECP antagonists may 
include for example the proteins or nucleic acids of the present invention insofar as 
they can be used as antagonists of wild-type ECP, in gene therapy or antisense therapy. 

5 A further aspect of this invention relates to a method for detecting individuals having 
a predisposition or susceptibility to certain disease states, in particular, cancer or 
cancer-related diseases, especially eosinophil-related cancers such as Hodgkin's 
disease. It is a further aspect of the invention to identify individuals having such a 
predisposition or susceptibility by identifying those individuals with an altered WT- 
1 0 ECP encoding polynucleotide. 

Accordingly, the invention provides a method of determining whether an individual is 
homozygous or heterozygous for a ECP encoding polynucleotide and a polymorphism 
thereof. The method comprises screening for an individual at risk of a condition or 
1 5 disease correlated with presence of the polymorphism. 

This aspect of the invention further extends to the use of the polynucleotides and 
polypeptides of the present invention in the treatment of such a patient with a mimetic 
of an ECP of the invention or an agent capable of inducing a genotypic modification 
20 to give rise to the expression of an ECP of the invention. 

The preferred embodiments of the present invention will be discussed below under the 
appropriate headings, 

25 Protein 

The term "protein" includes polypeptides having at least more than 5, 10 or 20 amino 
acids. 

30 The ECP proteins of use in the present invention all include replacement of arginine at 
position 97 of WT-ECP by an alternative amino acid residue. Preferably the 
replacement is effected by a threonine residue or biostere thereof at the position 
corresponding to Arg97 of WT-ECR Preferably, such proteins comprise the sequence 

6 
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C T Y or more preferably NCTYAor biosteres thereof at positions corresponding to 
amino acids 96-98 or 95-99 of WT-ECP respectively. The term biostere is used as 
understood in the art and encompasses modifications made to this sequence by (a) one 
or more amino acid residues being replaced by a naturally or non-naturally occurring 
5 amino acid residue (b) the order of two or more amino acid residues being reversed, 
(c) both (a) and (b) being present together and (d) a spacer group being present 
between any two amino acid residues, provided the resultant protein retains the 
activity of the parent protein. 

1 0 The remaining protein sequence may be identical to WT-ECP or a variant, homologue 
or derivative thereof, herein all defined as mutant ECPs. Such variant, homologue or 
derivative forms are discussed in detail below. 

A most preferred ECP of the present invention relates to that given as SEQ ID No. 4. 

15 

The following description of polypeptide homologues, variants and derivatives is to 
be read in conjunction with the ECPs of the invention described above and each 
preferred embodiment described. 

20 Polypeptide homologues 

It will be understood that protein sequences of the invention or for use in the invention 
are not limited to the particular sequences or fragments thereof or sequences obtained 
from the particular protein but also include homologous sequences obtained from any 
25 source, for example related viral/bacterial proteins, cellular homologues and synthetic 
peptides, as well as variants or derivatives thereof. 

Thus, the present invention covers variants, homologues or derivatives of protein 
sequences of the present invention, as well as variants, homologues or derivatives of 
30 the nucleotide sequence coding for the protein sequences of the present invention. 

In the context of the present invention, a homologous sequence is taken to include an 
amino acid sequence which is at least 60, 70, 80 or 90% identical, preferably at least 

7 
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95 or 98% identical at the amino acid level over at least 15 residues including the 
modified residue corresponding to arginine 97, preferably over 15 to 50 amino acids 
including said modified residue, even more preferably over the entire length of the 
protein sequence of the present invention including said modified residue. In 
5 particular, homology should typically be considered with respect to those regions of 
the sequence known to be essential for providing protection from cancer or cancer- 
related disorders, in particular, eosinophil-related cancers such as Hodgkin's disease, 
rather than non-essential neighbouring sequences. Although homology can also be 
considered in terms of similarity (i.e. amino acid residues having similar chemical 
10 properties/functions), in the context of the present invention it is preferred to express 
homology in terms of sequence identity. 

Homology comparisons can be conducted by eye, or more usually, with the aid of 
readily available sequence comparison programs. These commercially available 
1 5 computer programs can calculate % homology between two or more sequences. 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence directly 
compared with the corresponding amino acid in the other sequence, one residue at a 
20 time. This is called an "ungapped" alignment. Typically, such ungapped alignments 
are performed only over a relatively short number of residues (for example less than 
50 contiguous amino acids). 

Although this is a very simple and consistent method, it fails to take into consideration 
25 that, for example, in an otherwise identical pair of sequences, one insertion or deletion 
will cause the following amino acid residues to be put out of alignment, thus 
potentially resulting in a large reduction in % homology when a global alignment is 
performed. Consequently, most sequence comparison methods are designed to 
produce optimal alignments that take into consideration possible insertions and 
30 deletions without penalising unduly the overall homology score. This is achieved by 
inserting "gaps" in the sequence alignment to try to maximise local homology. 
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However, these more complex methods assign "gap penalties" to each gap that occurs 
in the alignment so that, for the same number of identical amino acids, a sequence 
alignment with as few gaps as possible - reflecting higher relatedness between the two 
compared sequences - will achieve a higher score than one with many gaps. "Affine 
5 gap costs" are typically used that charge a relatively high cost for the existence of a 
gap and a smaller penalty for each subsequent residue in the gap. This is the most 
commonly used gap scoring system. High gap penalties will of course produce 
optimised alignments with fewer gaps. Most alignment programs allow the gap 
penalties to be modified. However, it is preferred to use the default values when using 
10 such software for sequence comparisons. For example when using the GCG 
Wisconsin Bestfit package (see below) the default gap penalty for amino acid 
sequences is - 1 2 for a gap and -4 for each extension. 



Calculation of maximum % homology therefore firstly requires the production of an 
15 optimal alignment, taking into consideration gap penalties. A suitable computer 
program for carrying out such an alignment is the "GCG Wisconsin Bestfit package 
(University of Wisconsin, U.S.A.; Devereux et al, 1984, Nucleic Acids Research 
12:387). Examples of other software than can perform sequence comparisons include, 
but are not limited to, the BLAST package (see Ausubel et al, 1999 ibid - Chapter 
20 18), FASTA (Atschul et al, 1990, J. Mol. Biol., 403-410) and the GENEWORKS 
suite of comparison tools. Both BLAST and FASTA are available for offline and 
online searching (see Ausubel et aL, 1999 ibid, pages 7-58 to 7-60). However it is 
preferred to use the GCG Bestfit program. 



25 Although the final % homology can be measured in terms of identity, the alignment 
process itself is typically not based on an all-or-nothing pair comparison. Instead, a 
scaled similarity score matrix is generally used that assigns scores to each pairwise 
comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 

30 BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
further details). It is preferred to use the public default values for the GCG package, 
or in the case of other software, the default matrix, such as BLOSUM62. 

9 
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Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of 
the sequence comparison and generates a numerical result. 

5 

Polypeptide Variants and Derivatives 

The terms "variant" or "derivative" in relation to the amino acid sequences of the 
present invention includes any substitution, variation, modification, replacement, 
1 0 deletion or addition of one (or more) amino acids from or to the sequence providing 
the resultant amino acid sequence retains the advantageous properties of the parent 
ECP protein as described above. 

An ECP of the invention may be modified for use in the present invention. Typically, 
15 modifications are made that maintain the protection providing property of the 
sequence. Amino acid substitutions may be made, for example from 1, 2 or 3 to 10, 
20 or 30 substitutions provided that the modified sequence retains such properties. 
Amino acid substitutions may include the use of non-naturally occurring analogues, 
for example to increase blood plasma half-life of a therapeutically administered 
20 polypeptide. 

Thus, homologous substitution (substitution and replacement are both used herein to 
mean the interchange of an existing amino acid residue with an alternative residue) 
may occur, i.e. like-for-like substitution such as basic for basic, acidic for acidic, polar 

25 for polar, etc. Non-homologous substitution may also occur, i.e. from one class of 
residue to another or alternatively involving the inclusion of unnatural amino acids 
such as ornithine (hereinafter referred to as Z), diaminobutyric acid (hereinafter 
referred to as B), norleucine (hereinafter referred to as O), pyriylalanine, 
thienylalanine, naphthylalanine and phenylglycine, a more detailed list of which 

30 appears below. 



Conservative substitutions may be made, for example according to the Table below. 
Amino acids in the same block in the second column and preferably in the same line 
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ALIPHATIC 


Non-polar 


GAP 






IL V 




Polar - uncharged 


C STM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 



Such replacements may also be made by unnatural amino acids include; alpha* and 
5 alpha-disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide 
derivatives of natural amino acids such as trifluorotyrosine*, p-Cl-phenylalanine*, p- 
Br-phenylalanine* , p-I-phenylalanine* 9 L-ally Ugly cine* , fl-alanine* , L-a-amino 
butyric acid*, L-y-amino butyric acid*, L-a-amino isobutyric acid*, L-s-amino 
caproic acid*, 7-amino heptanoic acid*, L-methionine sulfone**, L-norleucine*, L- 

10 norvaline*, p-nitro-L-phenylalanine*, L-hydroxyproline*, L-thioproline*, methyl 
derivatives of phenylalanine (Phe) such as 4-methyl-Phe*, pentamethyl-Phe*, L-Phe 
(4~amino) # , L-Tyr (methyl)*, L-Phe (4-isopropyl)*, L-Tic (1,2,3,4- 
tetrahydroisoquinoline-3-carboxyl acid)*, L-diaminopropionic acid # and L-Phe (4- 
benzyl)*. The notation * has been used for the purpose of the discussion above 

1 5 (relating to homologous or non-homologous substitution), to indicate the hydrophobic 
nature of the derivative, whereas # has been used to indicate the hydrophilic nature of 
the derivative, #* indicates amphipathic characteristics. 

Variant amino acid sequences may include suitable spacer groups that may be inserted 
20 between any two amino acid residues of the sequence including alkyl groups such as 
methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or 0- 
alanine residues. A further form of variation, involves the presence of one or more 
amino acid residues in peptoid form, will be well understood by those skilled in the 
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art. For the avoidance of doubt, "the peptoid form" is used to refer to variant amino 
acid residues wherein the cc-carbon substituent group is on the residue's nitrogen atom 
rather than the a-carbon. Processes for preparing peptides in the peptoid form are 
known in the art, for example Simon RJ et al., PNAS (1992) 89(20), 9367-9371 and 
5 Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 

Proteins of the invention are typically made by recombinant means, for example as 
described below. However they may also be made by synthetic means using 
techniques well known to skilled persons such as solid phase synthesis, Proteins of 

1 0 the invention may also be produced as fusion proteins, for example to aid in extraction 
and purification. Examples of fusion protein partners include glutathione-S- 
transferase (GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation 
domains) and P-galactosidase. It may also be convenient to include a proteolytic 
cleavage site between the fusion protein partner and the protein sequence of interest to 

15 allow removal of fusion protein sequences. Preferably the fusion protein will not 
hinder the function of the protein of interest sequence. 

Proteins of the invention may be in a substantially isolated form. It will be understood 
that the protein may be mixed with carriers or diluents which will not interfere with 
20 the intended purpose of the protein and still be regarded as substantially isolated. A 
protein of the invention may also be in a substantially purified form, in which case it 
will generally comprise the protein in a preparation in which more than 90%, e.g. 
95%, 98% or 99% of the protein in the preparation is a protein of the invention. 

25 Polynucleotides 

One aspect of the invention provides a polynucleotide capable of encoding any of the 
above defined ECP polypeptides, or a fragment thereof. In a preferred embodiment, 
the polynucleotide is an ECP encoding polynucleotide in which guanosine at position 
30 926 is replaced with cytosine. A fragment of such a polynucleotide comprises 
position 926 and is at least 1 5 nucleotides in length. Preferably, the polynucleotide is 
an isolated DNA molecule which means that it is free from other DNA molecules that 
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are naturally associated therewith in nature. Such a polynucleotide is preferably 
capable of expressing an ECP protein of the invention as hereinbefore described. 

As discussed above, the present invention is based upon the observation of a single 
5 base polymorphism at position 926 of a human ECP encoding gene. A further aspect 
of this observation is that the polymorphism is correlated with a predisposition to 
cancers and cancer-related disorders, in particular, eosinophil-related cancers such as 
Hodgkin's disease. The invention is of advantage in that by screening for the presence 
of the polymorphism it is possible to identify individuals likely to have this 
10 predisposition. 

Polynucleotides of the invention or for use in the invention comprise nucleic acid 
sequences encoding the polypeptide sequences of the invention. Such polynucleotides 
may be identified on the basis of a change in the Pst I restriction digest pattern of said 

15 sequence when compared to that of a WT-ECP encoding polynucleotide (WT-ECP 
DNA, for example SEQ ID No. 1). Thus, Pst I digestion of WT-ECP DNA gives rise 
to 2 sub-fragments by virtue of a Pst I site at bases 921 to 926. Polynucleotide 
sequences encoding an ECP of the present invention lack the corresponding Pst I site 
and a single fragment will be obtained. This difference may also be used in the 

20 methods of detecting the presence or absence of an ECP encoding sequence as 
discussed below. 

As the preferred proteins of the present invention include a threonine residue at the 
position corresponding to Arg97 in WT-ECP, or in more preferred embodiments the 

25 sequences CTYor NCTYA, the polynucleotide sequences encoding such proteins 
will at least include the nucleotide sequence ACG or in preferred embodiments (AAC) 
TGC ACG TAT (GCA) corresponding to bases (919) 922 to 930 (933) of WT-ECP 
DNA. This particular region may be varied within the confines of the definition of the 
term "or a biostere thereof 9 as discussed above. The remaining polynucleotide may be 

30 identical to WT-ECP DNA or a variant, homologue or derivative thereof as discussed 
in detail below. 
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A specific embodiment of the invention is the nucleotide sequence of SEQ ID NO: 3, 
listed in the sequence listing below. In SEQ ID NO: 3, the polymorphism lies in C in 
place of G at position 926 of SEQ ID NO: 1 (WT-ECP). 

5 It will be understood by a skilled person that numerous different polynucleotides can 
encode the same polypeptide as a result of the degeneracy of the genetic code. In 
addition, it is to be understood that skilled persons may, using routine techniques, 
make nucleotide substitutions that do not affect the polypeptide sequence encoded by 
the polynucleotides of the invention to reflect the codon usage of any particular host 
10 organism in which the polypeptides of the invention are to be expressed. 

Polynucleotides of the invention may comprise DNA or RNA. They may be single- 
stranded or double-stranded. They may also be polynucleotides which include within 
them synthetic or modified nucleotides. A number of different types of modification 
to oligonucleotides are known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3' and/or 
5' ends of the molecule. For the purposes of the present invention, it is to be 
understood that the polynucleotides described herein may be modified by any method 
available in the art. Such modifications may be carried out in order to enhance the in 
vivo activity or life span of polynucleotides of the invention. 

The terms "variant", "homologue" or "derivative" in relation to the nucleotide 
sequences include any substitution of, variation of, modification of, replacement of, 
deletion of or addition of one (or more) nucleic acid from or to the sequence providing 
25 the resultant nucleotide sequence codes for a polypeptide retaining the advantageous 
characteristics of the ECP proteins of the present invention, preferably having at least 
the same activity as sequences presented in the sequence listings. 

As indicated above, with respect to sequence homology, preferably there is at least 
30 75%, more preferably at least 85%, more preferably at least 90% homology to the 
sequences shown in the sequence listing herein. More preferably there is at least 95%, 
more preferably at least 98%, homology. Nucleotide homology comparisons may be 
conducted as described above. A preferred sequence comparison program is the GCG 
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Wisconsin Bestfit program described above. The default scoring matrix has a match 
value of 1 0 for each identical nucleotide and -9 for each mismatch. The default gap 
creation penalty is -50 and the default gap extension penalty is -3 for each nucleotide. 

5 The present invention also encompasses nucleotide sequences that are capable of 
hybridising selectively to the sequences presented herein, or any variant, fragment or 
derivative thereof, or to the complement of any of the above. Nucleotide sequences 
are preferably at least 15 nucleotides in length, more preferably at least 20, 30, 40 or 
50 nucleotides in length. 

10 

The term "hybridization" as used herein shall include "the process by which a strand 
of nucleic acid joins with a complementary strand through base pairing" as well as the 
process of amplification as carried out in polymerase chain reaction technologies. 

15 Polynucleotides of the invention capable of selectively hybridising to the nucleotide 
sequences presented herein, or to their complement, will generally be at least 70%, 
preferably at least 80 or 90% and more preferably at least 95% or 98% homologous to 
the corresponding nucleotide sequences presented herein over a region of at least 20, 
preferably at least 25 or 30, for instance at least 40, 60, 100 or more contiguous 

20 nucleotides, or over the entire length of the polynucleotide of the invention. Preferred 
polynucleotides of the invention will comprise regions homologous to nucleotides that 
include the Pst I site discussed above. 

The term "selectively hybridizable" means that the polynucleotide used as a probe is 
25 used under conditions where a target polynucleotide of the invention is found to 
hybridize to the probe at a level significantly above background. The background 
hybridization may occur because of other polynucleotides present, for example, in the 
cDNA or genomic DNA library being screening. In this event, background implies a 
level of signal generated by interaction between the probe and a non-specific DNA 
30 member of the library which is less than 10 fold, preferably less than 100 fold as 
intense as the specific interaction observed with the target DNA. The intensity of 
interaction may be measured, for example, by radiolabelling the probe, e.g. with P. 
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Hybridization conditions are based on the melting temperature (Tm) of the nucleic 
acid binding complex, as taught in Berger and Kimmel (1987, Guide to Molecular 
Cloning Techniques, Methods in Enzymology, Vol 152, Academic Press, San Diego 
CA), and confer a defined "stringency" as explained below. 

5 

Maximum stringency typically occurs at about Tm -5°C (5°C below the Tm of the 
probe); high stringency at about 5°C to 10°C below Tm; intermediate stringency at 
about 10°C to 20°C below Tm; and low stringency at about 20°C to 25°C below Tm. 
As will be understood by those of skill in the art, a maximum stringency hybridization 
10 can be used to identify or detect identical polynucleotide sequences while an 
intermediate (or low) stringency hybridization can be used to identify or detect similar 
or related polynucleotide sequences. 

In a preferred aspect, the present invention covers nucleotide sequences that can 
15 hybridise to the nucleotide sequence of the present invention under stringent 
conditions (e.g. 65°C and O.lxSSC {lxSSC = 0.15 M NaCl, 0.015 M Na 3 Citrate pH 
7.0}). 

Where the polynucleotide of the invention is double-stranded, both strands of the 
20 duplex, either individually or in combination, are encompassed by the present 
invention. Where the polynucleotide is single-stranded, it is to be understood that the 
complementary sequence of that polynucleotide is also included within the scope of 
the present invention. 



25 Polynucleotides which are not 100% homologous to the sequences of the present 
invention but fall within the scope of the invention can be obtained in a number of 
ways. Other variants of the sequences described herein may be obtained for example 
by probing DNA libraries made from a range of individuals, for example individuals 
from different populations. In addition, other viral/bacterial, or cellular homologues 

30 particularly cellular homologues found in mammalian cells (e.g. rat, mouse, bovine 
and primate cells), may be obtained and such homologues and fragments thereof in 
general will be capable of selectively hybridising to the sequences shown in the 
sequence listing herein. Such sequences may be obtained by probing cDNA libraries 
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made from or genomic DNA libraries from other animal species, and probing such 
libraries with probes comprising all or part of SEQ LD. Nos 1-6 under conditions of 
medium to high stringency. Similar considerations apply to obtaining species 
homologues and allelic variants of the polypeptide sequences of the invention. 

5 

Variants and strain/species homologues may also be obtained using degenerate PCR 
which will use primers designed to target sequences within the variants and 
homologues encoding conserved amino acid sequences within the sequences of the 
present invention. Conserved sequences can be predicted, for example, by aligning the 
10 amino acid sequences from several variants/homologues. Sequence alignments can be 
performed using computer software known in the art. For example the GCG 
Wisconsin PileUp program is widely used. 

The primers used in degenerate PCR will contain one or more degenerate positions 
15 and will be used at stringency conditions lower than those used for cloning sequences 
with single sequence primers against known sequences. 

Alternatively, such polynucleotides may be obtained by site directed mutagenesis of 
characterised sequences, such as SEQ ID. No 3. This may be useful where for 
20 example silent codon changes are required to sequences to optimise codon preferences 
for a particular host cell in which the polynucleotide sequences are being expressed. 
Other sequence changes may be desired in order to introduce restriction enzyme 
recognition sites, or to alter the property or function of the polypeptides encoded by 
the polynucleotides. 

25 

Polynucleotides of the invention may be used to produce a primer, e.g. a PCR primer, 
a primer for an alternative amplification reaction, a probe e.g. labelled with a 
revealing label by conventional means using radioactive or non-radioactive labels, or 
the polynucleotides may be cloned into vectors. Such primers, probes and other 
30 fragments will be at least 15, preferably at least 20, for example at least 25, 30 or 40 
nucleotides in length, and are also encompassed by the term polynucleotides of the 
invention as used herein. 
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Polynucleotides such as a DNA polynucleotides and probes according to the invention 
may be produced recombinantly, synthetically, or by any means available to those of 
skill in the art. They may also be cloned by standard techniques. An example of how 
ECP encoding polynucleotides, such as those of the present inveniton may be 
5 produced recombinantly is provided in Rosenberg H, J Biol Chem (1995) 270, 7876- 
7881. 

In general, primers will be produced by synthetic means, involving a step wise 
manufacture of the desired nucleic acid sequence one nucleotide at a time. 
1 0 Techniques for accomplishing this using automated techniques are readily available in 
the art. 

Longer polynucleotides will generally be produced using recombinant means, for 
example using a PGR (polymerase chain reaction) cloning techniques. This will 

15 involve making a pair of primers (e.g. of about 15 to 30 nucleotides) flanking a region 
of the lipid targeting sequence which it is desired to clone, bringing the primers into 
contact with mRNA or cDNA obtained from an animal or human cell, performing a 
polymerase chain reaction under conditions which bring about amplification of the 
desired region, isolating the amplified fragment (e.g. by purifying the reaction mixture 

20 on an agarose gel) and recovering the amplified DNA. The primers may be designed 
to contain suitable restriction eixzyme recognition sites so that the amplified DNA can 
be cloned into a suitable cloning vector. 

Nucleotide vectors 

25 

Polynucleotides of the invention can be incoiporated into a recombinant replicable 
vector. The vector may be used to replicate the nucleic acid in a compatible host cell. 
Thus in a further embodiment, the invention provides a method of making 
polynucleotides of the invention by introducing a polynucleotide of the invention into 
30 a replicable vector, introducing the vector into a compatible host cell, and growing the 
host cell under conditions which bring about replication of the vector. The vector may 
be recovered from the host cell. Suitable host cells include bacteria such as E. coli, 
yeast, mammalian cell lines and other eukaryotic cell lines, for example insect Sf9 
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cells. 

Preferably, a polynucleotide of the invention in a vector is operably linked to a control 
sequence that is capable of providing for the expression of the coding sequence by the 
5 host cell, i.e. the vector is an expression vector. The term "operably linked" means 
that the components described are in a relationship permitting them to function in 
their intended manner. A regulatory sequence "operably linked" to a coding sequence 
is ligated in such a way that expression of the coding sequence is achieved under 
condition compatible with the control sequences. 

10 

The control sequences may be modified, for example by the addition of further 
transcriptional regulatory elements to make the level of transcription directed by the 
control sequences more responsive to transcriptional modulators. 

15 Vectors of the invention may be transformed or transfected into a suitable host cell as 
described below to provide for expression of a protein of the invention. This process 
may comprise culturing a host cell transformed with an expression vector as described 
above under conditions to provide for expression by the vector of a coding sequence 
encoding the protein, and optionally recovering the expressed protein. 

20 

The vectors may be for example, plasmid or virus vectors provided with an origin of 
replication, optionally a promoter for the expression of the said polynucleotide and 
optionally a regulator of the promoter. The vectors may contain one or more 
selectable marker genes, for example an ampicillin resistance gene in the case of a 
25 bacterial plasmid or a neomycin resistance gene for a mammalian vector. Vectors may 
be used, for example, to transfect or transform a host cell. 

Control sequences operably linked to sequences encoding the protein of the invention 
include promoters/enhancers and other expression regulation signals. These control 
30 sequences may be selected to be compatible with the host cell for which the 
expression vector is designed to be used in. The term promoter is well-known in the 
art and encompasses nucleic acid regions ranging in size and complexity from 
minimal promoters to promoters including upstream elements and enhancers. 
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The promoter is typically selected from promoters which are functional in 
mammalian, cells, although prokaryotic promoters and promoters functional in other 
eukaryotic cells may be used. The promoter is typically derived from promoter 
5 sequences of viral or eukaryotic genes. For example, it may be a promoter derived 
from the genome of a cell in which expression is to occur. With respect to eukaryotic 
promoters, they may be promoters that function in a ubiquitous manner (such as 
promoters of a-actin, b-actin, tubulin) or, alternatively, a tissue-specific manner (such 
as promoters of the genes for pyruvate kinase). They may also be promoters that 
10 respond to specific stimuli, for example promoters that bind steroid hormone 
receptors. Viral promoters may also be used, for example the Moloney murine 
leukaemia virus long temiinal repeat (MMLV LTR) promoter, the rous sarcoma virus 
(RSV) LTR promoter or the human cytomegalovirus (CMV) IE promoter. 

15 It may also be advantageous for the promoters to be inducible so that the levels of 
expression of the heterologous gene can be regulated during the life-time of the cell. 
Inducible means that the levels of expression obtained using the promoter can be 
regulated. 

20 In addition, any of these promoters may be modified by the addition of further 
regulatory sequences, for example enhancer sequences. Chimeric promoters may also 
be used comprising sequence elements from two or more different promoters 
described above. 

25 Host cells 

Vectors and polynucleotides of the invention may be introduced into host cells for the 
purpose of replicating the vectors/polynucleotides and/or expressing the proteins of 
the invention encoded by the polynucleotides of the invention. Although the proteins 
30 of the invention may be produced using prokaryotic cells as host cells, it is preferred 
to use eukaryotic cells, for example yeast, insect or mammalian cells, in particular 
insect cells such as those including a polyhedrin promoter. 
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Vectors/polynucleotides of the invention may introduced into suitable host cells using 
a variety of techniques known in the art, such as transfection, transformation and 
electroporation. Where vectors/polynucleotides of the invention are to be 
administered to animals, several techniques are known in the art, for example 
5 infection with recombinant viral vectors such as retroviruses, herpes simplex viruses 
and adenoviruses, direct injection of nucleic acids and biolistic transformation. 

Protein Expression and Purification 

10 Host cells comprising polynucleotides of the invention may be used to express 
proteins of the invention. Host cells may be cultured under suitable conditions which 
allow expression of the proteins of the invention. Expression of the proteins of the 
invention may be constitutive such that they are continually produced, or inducible, 
requiring a stimulus to initiate expression. In the case of inducible expression, protein 

15 production can be initiated when required by, for example, addition of an inducer 
substance to the culture medium, for example dexamethasone or IPTG. 

Proteins of the invention can be extracted from host cells by a variety of techniques 
known in the art, including enzymatic, chemical and/or osmotic lysis and physical 
20 disruption. Rosenberg H (supra) describes how ECP proteins may be produced 
recombinantly. 

Administration 

25 Proteins of the invention and substances identified or identifiable by the assay 
methods of the invention may preferably be combined with various components to 
produce compositions of the invention. Preferably the compositions are combined 
with a pharmaceutically acceptable carrier or diluent to produce a pharmaceutical 
composition (which may be for human or animal use). Suitable carriers and diluents 

30 include isotonic saline solutions, for example phosphate-buffered saline. The 
composition of the invention may be administered by direct injection. The 
composition may be formulated for parenteral, intramuscular, intravenous, 
subcutaneous, intraocular or transdermal administration. Typically, each protein may 
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be administered at a dose of from 0.01 to 30 mg/kg body weight, preferably from 0.1 
to 10 mg/kg, more preferably from 0.1 to 1 mg/kg body weight. 

Polynucleotides/vectors encoding polypeptide components for use in affecting viral 
5 infections may be administered directly as a naked nucleic acid construct, preferably 
further comprising flanking sequences homologous to the host cell genome. When the 
polynucleotides/vectors are administered as a naked nucleic acid, the amount of 
nucleic acid administered may typically be in the range of from 1 fig to 10 mg, 
preferably from 1 00 \xg to 1 mg. 

10 

Uptake of naked nucleic acid constructs by mammalian cells is enhanced by several 
known transfection techniques for example those including the use of transfection 
agents. Example of these agents include cationic agents (for example calcium 
phosphate and DEAE-dextran) and lipofectants (for example lipofectam and 
15 transfectam™). Typically, nucleic acid constructs are mixed with the transfection 
agent to produce a composition. ■ 

Preferably the polynucleotide or vector of the invention is combined with a 
pharmaceutically acceptable carrier or diluent to produce a pharmaceutical 
20 composition. Suitable carriers and diluents include isotonic saline solutions, for 
example phosphate-buffered saline. The composition may be formulated for 
parenteral, intramuscular, intravenous, subcutaneous, intraocular or transdermal 
administration. 

25 The routes of administration and dosages described are intended only as a guide since 
a skilled practitioner will be able to determine readily the optimum route of 
administration and dosage for any particular patient and condition. 

Assays 

30 

The invention further includes assays, being methods of detecting the presence or 
absence of a polymorphism in an ECP-encoding polynucleotide. In this embodiment 
of the invention, the method of detection may employ a polymerase chain reaction, 
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single strand conformational polymorphism assay, or any such detection technique 
described below under the heading "Genotyping", and determining whether an 
individual possesses a wild type ECP encoding polynucleotide or a polymorphism 
thereof. Each individual may be homozygous for the wild type, heterozygous for the 
5 wild type and a polymorphism, or homozygous for polymorphisms in the ECP 
encoding polynucleotide. In this respect the term "polymorphism" is used to refer to 
that that distinguishes the ECP proteins/polynucleotides of the present invention from 
those described herein as wild-type. Presence of wild type ECP correlates with 
predisposition to cancer or cancer-related disorders, in particular, eosinophil-related 
10 cancers such as Hodgkin's disease. Optionally, the method further comprises use of 
an indicator means to react to the presence of the wild type allele. 

Indicator means typically induces a detectable signal upon presence of the 
polymorphism, and can induce a colour change or a coagulation or induce a restriction 
15 site, detectable by further analytical steps. Another indicator means comprises an 
antibody that has binding affinity that distinguishes between a wild type sequence and 
a polymorphism. 

A particular method of the invention comprises screening for a wild type allele in an 
20 ECP encoding polynucleotide by virtue of the presence of a Pst I restriction site at 
bases 921 to 926 of the ECP DNA sequence shown in SEQ ID No. 1, wherein 
presence of the wild type allele correlates with predisposition to cancer or cancer- 
related disorders, in particular, eosinophil-related cancers such as Hodgkin's disease. 
This method is discussed above in respect of the polynucleotides of the present 
25 invention. 

In use of a specific embodiment of the invention to be described below in further 
detail, an individual is screened to detemiine whether he or she possess a Pst I 
restriction site at bases 921 to 926 of an ECP encoding polynucleotide which is a 
30 published sequence or is a polymorphism thereof in which a guanosine nucleotide at 
position 926 has been replaced by a cytosine nucleotide. In this specific embodiment, 
the absence of the polymorphism in which guanosine is replaced by cytosine at 
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position 926 correlates with a predisposition to cancer or cancer-related disorders, in 
particular, eosinophil-related cancers such as Hodgkin's disease. 

Screening is carried out, for example, using PCR primers adapted to amplify a portion 
5 of an ECP encoding polynucleotide that includes the nucleotide at position 926. It is 
preferred that the PCR primers are selected so as to amplify a region of the 
polynucleotide that surrounds position 926 and includes at least six nucleotides on 
either side of this position. Such a pair of primers are shown herein as SEQ ID's Nos. 
5 and 6. These primers will give rise to a 644bp fragment of SEQ ID No. 1 as 

1 0 described in Example 1 . This fragment includes the region encoding the ECP protein. 
PCR techniques are well known in the art and it would be within the ambit of a person 
of ordinary skill in this art to identify primers for amplifying a suitable section of the 
ECP encoding polynucleotide that includes the nucleotide at position 926. PCR 
techniques are described for example in EP-A-0200362, EP-A-02011S4, and U.S. Pat. 

15 Nos. 4,683,195, 4,683,202 and 4, 965,188. The amplified products may then be 
subjected to Pst I digestions and the resultant fragments separated by, for example gel 
electrophoresis. The pattern of fragments produced by such Pst I digestion is 
indicative as to the presence of absence of an ECP encoding sequence of the present 
invention. Thus, two fragments indicates the presence of WT-ECP encoding DNA, a 

20 single fragment indicative of homozygous presentation of the polymorphism at 
position 926 (as discussed above) and three fragments indicative of heterozygous 
presentation of the polymorphism. 

In a further embodiment of the invention, the diagnostic method comprises analysis of 
25 the region surrounding position 926 of an ECP encoding polynucleotide using single 
strand conformational polymorphism (SSCP) mapping. It is preferred that the PCR 
primers for this purpose are selected so as to be homologous with a region of the 
genome within 200 bp of position 926 on the ECP encoding polynucleotide. It is 
further preferred that the PCR primers are selected so that position 926 is substantially 
30 towards the middle of the amplified DNA segment. 

The invention further provides a diagnostic kit comprising diagnostic means 
according to this aspect of the invention, optionally within a container. Thus, the 
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invention further provides a diagnostic kit comprising a carrier means such as a carton 
or box being compartmentalised to receive in close confinement therein the detection 
means according to the invention, optionally within a container means such as a vial, 
tube, ampoule, and the like. Further container means may also be present which 
5 comprise other elements of the method of detection as described herein. 

The detection means of the present invention may preferably comprise primers SEQ 
ID NOs: 5 and 6, the sequences of which are shown in the sequence listing below. The 
5' end of SEQ ID NO: 5 binds at position 495 of SEQ ID No. 1, the 3' end of SEQ ID 
10 NO: 6 binds at position 1114. Preparation of further primers suitable for determining 
genotype of a ECP encoding polynucleotide will be within the ambit of a person of 
ordinary skill in the art. 

Further aspects of the present invention relate to assays for compounds capable of 
1 5 acting as agonists of an ECP protein of the present invention. Such agonists may be 
administered to patients identified by a method of detection described above as having 
a predisposition to cancer or cancer-related disorders, in particular, eosinophil-related 
cancers such as Hodgkin's disease. While not wishing to be bound by theory, it is 
believed that the ECP proteins of the present invention provide their beneficial 
20 properties by not being cytotoxic and/or by not being fibroblast activating like wild- 
type ECP. Assays that are capable of measuring whether these proposed end-points, 
particularly fibroblast activation, may be of use in identifying agonists of the ECP 
proteins are described herein. The mutant ECPs described above are examples of 
such agonists. 

25 

End point assays of the present invention, used to identify mimics or agonists of the 
ECP proteins described herein, include but are not limited to: 

1 . Assays for detecting cytotoxic properties with respect to the killing of bacteria 
30 and/or cancer cells; 

FMCA procedure: A modification of the fluorometric microculture cytotoxic assay 
(FMCA) described by Larson R et al (1992) is used. An erythroleukemic K562 cell 
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line is cultured in RPMI 1640 (HyClone, Cramlington, UK) supplemented with 10% 
heat inactivated foetal calf serum (FCS) (HyClone, Cramlington, UK), penicillin 60 
[ig/mL and streptomycin 50 (ig/mL (HyClone, Cramlington, UK). By the day of assay, 
the cells are washed three times in RPMI 1640 supplemented with penicillin 60 
5 ng/mL and streptomycin 50 {ig/mL and without FCS. K562 cells, 20 000 cells/well, 
are seeded into wells of V-shaped 96-well microtiter plates (Nunc, Roskiide, 
Denmark) in triplicates, 10 fiL ECP in 0.2 M Na-acetate buffer pH 5.5 at a final 
concentration of 20 [ig/niL/well is added. Wells with cells and buffer serve as a 
negative control. As a positive control for cytotoxicity, cells treated with Triton at a 

10 final concentration of 0.01 % are employed. The culture plates are then incubated at 
+37 °C in humidified atmosphere containing 95% air and 5 % C0 2 for 72 hr, followed 
by centrifugation (200 x G, 7 min). After medium removal and one wash with PBS 
200 [iL/well, 100 [J,L/well of PBS containing fluorescein diacetate (FDA) (Sigma 
Chemical, Co, St. Louis, MO, USA) (10 |ig/mL) is added. Subsequently the plates are 

1 5 incubated for 1 hr at +37 °C before reading fluorescence with filters set at 485 and 538 
for excitation and emission, respectively (Fluorescan 2, Labsystems OY, Helsinki, 
Finland). The fluorometer is blanked against wells containing PBS including FDA dye 
but without cells. The fluorescence data is transferred to custom-made software for 
automated data calculation using Microsoft Excel and a Mcintosh SE/30 personal 

20 computer. The results obtained by the indicator FDA are presented as survival index 
(SI) defined as fluorescence in test wells in percent of control wells. Individual 
column fractions were tested for toxicity to K562 cells. Quality criteria for a 
successful assay include a fluorescence signal in control cultures of >5 x mean blank 
values and mean coefficient of variation in control wells of <20%. 

25 

DiSC procedure: In parallel, a modified form of the short-term dye-exclusion test, 
Differential Staining and Cytotoxicity (DiSC) assay (Weisenthal LM, Marsden JA, 
Dill PL, Macaluso CK. A Novel dye exclusion method for testing in vitro 
chemosensitivity of human tumors. Cancer Res 1983, 43:749-57 , Nygren P, 
30 Kristensen J, Jonsson B ? Sunstrom C, Lonnerholm G 5 Kreuger A, Larsson R. 
Feasability of the fluorometric microculture cytotoxocitity assay (FMCA) 
for cytotoxic drug sensitivity testing of tumor cells from patients with acute 
lymphoblastic leukemia. Leukemia 1992 6:1121-8) is performed. Immediately after 
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fluorescence measurement, selected wells are exposed to a mixture of Fast green, 1 %, 
Nigrosin, 0.5% (Sigma Chemical, Co, St Louis, MO, USA) and 25 000 formaldehyde- 
fixed chicken erythrocytes/well for 10 min at room temperature. The cellular content 
of the wells is subsequently cytocentrifuged onto slides using a Cytospin 3 (Shandon, 
5 Astmoore, UK) and counter stained with May-Gmnewald-Giemsa stain. Cell survival 
is evaluated by light microscopy. Viable cells stain normal Giemsa morphology, 
whereas dead cells and chicken erythrocytes stain greenish-black. Tumour cell 
survival using this modified DiSC procedure (Nygren et al 1992 - supra) is calculated 
by expressing the survival index (SI) as a ratio of viable K562 cells to fixed 
10 erythrocytes in experimental wells as a percentage of the ratio obtained in control 
wells. 

2. Assays for detecting RNase activity; 

1 5 Rapid RNase detection may be achieved using a cleavable fluorescent-labelled RNase 
substrate. 5\xl of 10X RnaseAlert Buffer (Rnase Alert Kit- Ambion and Integrated 
DNA Technologies, Inc.) is pipeted into tubes containing lyophilized Fluorescent 
Substrate. Up to 45|al of the solution to be tested is then added and the mixture is 
incubated for 30-60 minutes at 37°C. The Fluorescent Substrate is a modified RNA 

20 oligonucleotide that emits a green fluorescence if it is cleaved by RNase. The 
fluorescence is visually detected using short-wave UV illumination or measured in a 
fluoi*ometer. Solutions containing RNase activity will produce a green glow in the 
assay whereas solutions without RNase activity will not fluoresce. The amount of 
RNase activity will be directly proportional to the rate of fluorescence increase. 

25 Quantitative measurements can be obtained from a fluorometer. 

3. Assays for measuring the capacity of a compound to affect fibroblasts in their 
production of collagen and other proteoglycans; 

30 Cultures of human embryonic lung fibroblasts (HFL-1) are established according to 
Malmstrom (Malmstrom A, Fransson LA Biosynthesis of dermatan sulfate. I. 
Formation of L-iduronic acid residues. J Biol Chem 1975, 250:3419-25), cultivated in 
25 cm cell culture flasks, 24-well cell culture plates or 96-well microplates in 
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Dulbecco's modified Eagle's medium with 10% new-born calf serum (NCS) and 
grown to confluence. At confluence, the medium is changed to a sulfate poor medium 
(Dulbecco's special medium) containing 0.4% NCS, supplemented with 50 mg/ml 
ascorbic acid, 0.2 mM L-prolin and 4 mM glutamine. The cultures are then 
5 reconditioned for 3-4 hours. ECP is added to the cultures in different concentrations 
(0.1, 1, 10, 100 |ig/ml Dulbecco's special medium. 0.4 ml/well). Medium alone with 
0.4% NCS served as control. After 24 hours the radioactive precursor ([ 35 S]-sulphate, 
200 (-iCi/ml) is added for an additional 24 hours. The medium is then decanted and 
the remaining cell layer is washed with phosphate-saline buffer that also are added to 

1 0 the medium fractions. The cell layer is further extracted with 4 M guanidine chloride, 
0.05 M sodium acetate, pH 5.8 containing protease inhibitors (0.01 M EDTA, 0.005 M 
N-ethylmalemide) and 1% triton X-100 overnight. Both medium and cell extract (the 
latter after dilution with 20 volumes of 6 M urea, 0.05 M sodium acetate, pH 5.8 
containing the same protease inhibitors as above, 5 fig ovalbumin and 0.1% triton X- 

15 100) are subjected on columns (0.5 x 0.7 cm) of DE-52. All other experiments with 
ECP used a concentration of 1 0 |ig/ml. 

Proteoglycans are precipitated with Alcian blue (Bjornsson S Simultaneous 
preparation and quantitation of proteoglycans by precipitation with alcian blue. Anal 

20 Biochem 1993 210:282-91). The methods specificity is based on low pH and high salt 
concentration in combination with detergent. The Alcian blue precipitation is used for 
quantitation of proteoglycans and glycosaminoglycans. The proteoglycans are further 
analysed before and after digestion with electrophoresis (Bjornsson S Size-dependent 
separation of proteoglycans by electrophoresis in gels of pure agarose. Anal Biochem 

25 1993 210:292-8). The electrophoresis method used is a discontinuous buffer system. 
The cathode (over) buffer is 0.1 M Tris-Ac buffer and in the anode buffer the Tris-Ac 
is decreased to 0.01 M, pH 7.3. All samples are added to a sample gel of tris-glycine 
(TG) x 2, 2.5 % SDS, 0.1% agarose, and 15% glycerol. These samples are applied to 
a gel of 2% agarose. The gels are analysed and scanned on the Fuji bio image analyse 

30 system from Fuji Photo Film CO., Ltd. 



Culture medium combined with PBS washes, are precipitated by addition of ethanol to 
a final concentration of 67% at 40°C overnight. Protein is separated from free amino 
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acids by filtration through a 0,45|j,m pore filter (type HV) using a vacuum filtration 
unit. The supernatant is retained and the filter washed twice with ethanol (67%). The 
filter with adherent proteins is hydrolysed in hydrochloric acid (HC1, 6 M) at 110°C 
for 16 hours. Supernatants are evaporated to dryness and hydrolysed as above. 
5 Hydrolysates are mixed with charcoal (30 mg) and filtered (0.65 p,m, type DA) prior 
to chromatography. Hydroxyproline is isolated and measured by reveresed-phase- 
HPLC after derivatization with 7-cloro-4-nitorbenzo-2-oxa- 1 , 3-diazole (NBD-CI). 
Briefly, a 200 |al aliquot of the hydrolysates prepared as described above is buffered 
with potassium tetraborates (100 [il, 0.4 M) and reacted with 12 mM NBD-C1 in 

10 methanol (100 |al). Samples are protected from light with aluminium foil and 
incubated at 37° C for 20 minutes. The reaction is stopped by addition of hydrochloric 
acid (50 \xl, 1.5 M) and finally 150 |al sodium acetate (167 mM) in acetonitrile (26% 
V7V) is added. Samples are filtrated (pore size 0.22 |im, type GV, Millipore, UK) and 
a 100 \x\ aliquot is loaded onto the column. The Hydroxyproline content in each 

1 5 sample is determined by comparing peak areas of samples from the chromatogram to 
those generated from standard solutions, derivatized and separated under identical 
conditions. 

Hydroxyproline measured in the ethanol-insoluble fractions is taken as an index of 
20 procellagen production and the rate of procollagen synthesis is obtained from the 
combined values for ethanol-soluble and combined values for ethanol-soluble and 
ethanol —insoluble fractions. The DNA from the cell layer are extracted with 500(ixl BS 
over night and then sonicated 10-20 seconds INTENSITET. Bisbenzimidzole is added 
to samples and the fluorescence spectrophotometer (Perkin-EImer reader, LS-5B) 
25 (Labarca C, Paigen K A simple, rapid, and sensitive DNA assay procedure. Anal 
Biochem 1980 102:344-52). 

The above and other assays may also be used to identify antagonists of wild-type ECP. 
These may be identified as agents that are capable of binding to WT-ECP, inhibiting 
30 WT-ECP RNase activity, and/or binding to DNA or RNA encoding WT-ECP and 
hence preventing the induction of cancer and cancer-related disorders, in particular, 
eosinophil-related cancers. Examples of such agents include, but are not limited to, 
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the ECP mutation 97 of the present invention and anti-ECP antibodies such as 
described in Rosenberg H (supra). 

A more detailed discussion of suitable assay techniques is provided below. 

5 

GENOTYPING 

As used herein, the term "genotyping" means determining whether an ECP encoding 
polynucleotide includes a guanosine at position 926. The term "genotyping" is 
10 synonymous with terms such as "genetic testing", "genetic screening", "determining 
or identifying an allele or polymorphism", "molecular diagnostics" or any other 
similar phrase. 

Any method capable of distinguishing nucleotide differences in the appropriate 
15 sample DNA sequences may also be used. In fact, a number of known different 
methods are suitable for use in genotyping (that is, determining the genotype) for a an 
ECP encoding polynucleotide of the present invention. These methods include but are 
not limited to direct sequencing, PCR-RFLP, ARMS-PCR, Taqman™, Molecular 
beacons, hybridization to oligonucleotides on DNA chips and arrays, single nucleotide 
20 primer extension and oligo ligation assays. 

GENOTYPE SCREENING 

In one embodiment, the present invention provides a method for genotype screening 
25 of a nucleic acid comprising a an ECP encoding polynucleotide from an individual. 
The methods for genotype screening of a nucleic acid comprising an ECP encoding 
polynucleotide from an individual may require amplification of a nucleic acids from a 
target sample from that individual. 

3 0 TARGET SAMPLE 

The target samples of the present invention may be any target nucleic acid comprising 
a an ECP encoding polynucleotide from an individual being analysed. For assay of 
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such nucleic acids, virtually any biological sample (other than pure red blood cells) is 
suitable. For example, convenient target samples include but are not limited to whole 
blood, leukocytes, semen, saliva, tears, urine, faecal material, sweat, buccal, skin and 
hair. For assay of cDNA or rnRNA, the target sample is typically obtained from a cell 
5 or organ in which the target nucleic acid is expressed. 

GENOTYPING SNPS 

A number of different methods are suitable for use in determining the genotype for an 
10 SNR These methods include but are not limited to direct sequencing, PCR-RFLP, 
ARMS-PCR, Taqman™, Molecular beacons, hybridization to oligonucleotides on 
DNA chips and arrays, single nucleotide primer extension and oligo ligation assays. 
Any method capable of distinguishing single nucleotide differences in the appropriate 
DNA sequences may also be used. 

15 

AMPLIFICATION 

As used herein, the term "amplification means nucleic acid replication involving 
template specificity. The template specificity relates to a "target sample" or "target 

20 sequence" specificity. The target sequences are "targets" in the sense that they are 
sought to be sorted out from other nucleic acids. Consequently, amplification 
techniques have been designed primarily for sorting this out. Examples of 
amplification methods include but are not limited to polymerase chain reaction (PCR), 
polymerase chain reaction of specific alleles (PASA), ligase chain reaction (LCR), 

25 transcription amplification, self-sustained sequence replication and nucleic acid based 
sequence amplification (NASBA). 

TAQMAN™ 

30 Suitable means for determining genotype may be based on the Taqman™ technique. 
The Taqman™ technique is disclosed in the following US patents 4,683,202; 
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4,683,195 and 4,965,188. The use of uracil N-glycosylase which is included in 
Taqman™ allelic discrimination assays is disclosed in US patent 5,035,996. 

PGR 

5 

PCR techniques are well known in the art (see for example, EP-A-0200362 and EP-A- 
0201184 and US patent numbers 4 683 195 and 4 683 202). The process for 
amplifying the target sequence consists of introducing a large excess of two 
oligonucleotide primers to the DNA mixture containing the desired target sequence, 

10 followed by a precise sequence of thermal cycling in the presence of a DNA 
polymerase. With PCR, it is possible to amplify a single copy of a specific target 
sequence in, for example, genomic DNA to a level detectable by several different 
methodologies (such as hybridisation with a labelled probe, incoiporation of 
biotinylated primers followed by avidin-enzyme conjugate detection and incorporation 

15 of 32 P labelled deoxynucleotide triphosphates, such as dCTP or dATP, into the 
amplified sequence). Alternatively, it is possible to amplify different polymorphic sites 
(markers) with primers that are differentially labelled and thus can each be detected. 
One means of analysing multiple markers involves labelling each marker with a 
different fluorescent probe. The PCR products are then analysed on a fluorescence 

20 based automated sequencer. In addition to genomic DNA, any oligonucleotide 
sequence may be amplified with the appropriate set of primer molecules. In 
particular, the amplified segments created by the PCR process itself are, themselves, 
efficient templates for subsequent PCR amplifications. By way of example, PCR can 
also be used to identify primers for amplifying suitable sections of an ECP encoding 

25 polynucleotide in or from a human. 

PRIMERS 

The present invention also provides a series of useful primers. 

30 

As used herein, the term "primer" refers to a single-stranded oligonucleotide capable 
of acting as a point of initiation of template-directed DNA synthesis under appropriate 
conditions (i.e., in the presence of four different nucleoside triphosphates and an agent 
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for polymerization, such as, DNA or RNA polymerase or reverse transcriptase) in an 
appropriate buffer and at a suitable temperature. The appropriate length of a primer 
depends on the intended use of the primer but typically ranges from 15 to 30 
nucleotides. Short primer molecules generally require cooler temperatures to form 
5 sufficiently stable hybrid complexes with the template. A primer need not reflect the 
exact sequence of the template but must be sufficiently complementary to hybridize 
with a template. 

The term "primer site" refers to the area of the target DNA to which a primer 
10 hybridizes. 

The term "primer pair" means a set of primers including a 5 1 upstream primer that 
hybridizes with the 5' end of the DNA sequence to be amplified and a 3' downstream 
primer that hybridizes with the complement of the 3' end of the sequence to be 
15 amplified. 

The primers of the present invention may be DNA or RNA, and single-or double- 
stranded. Alternatively, the primers may be naturally occurring or synthetic, but are 
typically prepared by synthetic means. 

20 

PRIMER HYBRIDISATION CONDITIONS 

As used herein, the term "hybridisation" refers to the pairing of complementary 
nucleic acids. Hybridisation and the strength of hybridisation (ie the strength of 
25 association between the nucleic acids) is impacted by such factors as the degree of 
complementarity between nucleic acids, stringency of conditions involved, the melting 
temperature (Tm) of the formed hybrid and the G:C ratio within the nucleic acids. 

As used herein, the term "stringency" is used in reference to the conditions of 
30 temperature, ionic strength and the presence of other compounds such as organic 
solvents under which the nucleic acid hybridisations are conducted. 
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Hybridizations are typically performed under stringent conditions, for example, at a 
salt concentration of no more than 1M and a temperature of at least 25 °C. For 
example, conditions of 5X SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM EDTA, 
pH 7.4) and a temperature of 25-30°C. are suitable for allele-specific primer 
5 hybridizations. 

ALLELE SPECIFIC PRIMERS 

An allele-specific primer hybridizes to a site on target DNA overlapping a 
10 polymorphism and only primes amplification of an allelic form to which the primer 
exhibits perfect complementarity (See Gibbs, Nucleic Acid Res. 17, 2427-2448 
(1989)). This primer may be used in conjunction with a second primer which 
hybridizes at a distal site. Amplification proceeds from the two primers leading to a 
detectable product signifying the particular allelic form is present. A control may be 
1 5 performed with a second pair of primers, one of which shows a single base mismatch 
at the polymorphic site and the other of which exhibits perfect complementarity to a 
distal site. The single-base mismatch prevents amplification and no detectable product 
is formed. The method works best when the mismatch is included in the 3'-most 
position of the oligonucleotide aligned with the polymorphism because this position is 
20 most destabilizing to elongation from the primer (see, for example WO 93/22456). 
Hybridisation probes capable of specific hybridisation to detect a single base 
mismatch may be designed according to methods known in the art and described in 
Maniatas et al Molecular Cloning: A Laboratory Manual, 2 nd Ed (1989) Cold Spring 
Harbour. 

25 

PCR PRIMERS 

Preferably the screening is carried out using PCR primers designed to amplify 
portions of the human an ECP encoding polynucleotide (gene) that include nucleotide 

30 775. 

Examples of such PCR primers are shown as SEQ ID Nos. 5 and 6. 
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DETECTION OF POLYMORPHISMS IN AMPLIFIED TARGET SEQUENCES 

The amplified nucleic acid sequences may be detected using procedures including but 
not limited to allele-specific probes, tiling arrays, direct sequencing, denaturing 
gradient gel electrophoresis and single-strand conformation polymorphism (SCCP) 
analysis. 

ALLELE-SPECIFIC PROBES 

Allele-specific probes can be designed that hybridize to a segment of target DNA from 
one individual but do not hybridize to the corresponding segment from another 
individual due to the presence of different polymorphic forms in the respective 
segments from the two individuals. 

As used herein, the term "probe" refers to an oligonucleotide (i.e. a sequence of 
nucleotides), whether occurring naturally as in a purified restriction digest or produced 
synthetically, which is capable of hybridizing to another oligonucleotide sequence of 
interest. Probes are useful in the detection, identification and isolation of particular 
gene sequences. The hybridization probes of the present invention are typically 
oligonucleotides capable of binding in a base-specific manner to a complementary 
strand of nucleic acid. 

The probes of the present invention may be labeled with any "reporter molecule" so 
that it is detectable in any detection system, including but not limited to enzyme (for 
example, ELISA, as well as enzyme based histochemical assays), fluorescent, 
radioactive and luminescent systems. The target sequence of interest (that is, the 
sequence to be detected) may also be labeled with a reporter molecule. The present 
invention is not limited to any particular detection system or label. 

The hybridization conditions chosen for the probes of the present invention are 
sufficiently stringent that there is a significant difference in hybridization intensity 
between alleles, and preferably an essentially binary response, whereby a probe 
hybridizes to only one of the alleles. The typical hybridization conditions are 
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stringent conditions as set out above for the allele specific primers of the present 
invention so that a one base pair mismatch may be determined. 

TILING ARRAYS 

5 

The polymorphisms of the present invention may also be identified by hybridization to 
nucleic acid arrays, some example of which are described in WO 95/11995. The term 
"tiling" generally means the synthesis of a defined set of oligonucleotide probes that is 
made up of a sequence complementary to the sequence to be analysed (the "target 
1 0 sequence"), as well as preselected variations of that sequence. The variations usually 
include substitution at one or more base positions with one or more nucleotides. 

DIRECT SEQUENCING 

1 5 The direct analysis of the sequence of polymorphisms of the present invention may be 
accomplished using either the dideoxy chain termination method or the Maxam 
Gilbert method (see Sambrook et aL, Molecular Cloning, A Laboratory Manual (2nd 
Ed., CSHP, New York 1989) or using, for example, Standard ABI sequencing 
technology using Big Dye Terminator cycle sequencing chemistry analyzed on an ABI 

20 Prism 377 DNA sequencer. Preferably., the polymorphism used in the assays of the 
present invention are identified by the presence or absence of the fragments generated 
by PstI restriction analysis of the identified sequences. 

DENATURING GRADIENT GEL ELECTROPHORESIS 

25 

Amplification products of the present invention, which are generated using PCR, may 
also be analyzed by the use of denaturing gradient gel electrophoresis. Different 
alleles may be identified based on the different sequence-dependent melting properties 
and electrophoretic migration of DNA in solution. Erlich, ed., PCR Technology, 
30 Principles and Applications for DNA Amplification, (W.H. Freeman and Co, New 
York, 1992), Chapter 7. 

SINGLE-STRAND CONFORMATION POLYMORPHISM (SCCP) ANALYSIS 
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Alleles of target sequences of the present invention may also be differentiated using 
single-strand conformation polymorphism (SCCP) analysis, which identifies base 
differences by alteration in electrophoretic migration of single stranded PCR products, 
as described in Orita et ah, Proc. Nat. Acad. Sci. 86, 2766-2770(1989). Amplified 
PCR products can be generated as described above, and heated or otherwise 
denatured, to form single stranded amplification products. Single-stranded nucleic 
acids may refold or form secondary structures which are partially dependent on the 
base sequence. The different electrophoretic mobilities of single-stranded 
amplification products may be related to base-sequence difference between alleles of 
target sequences. 

IDENTIFYING DIFFERENCES BETWEEN TEST AND CONTROL SEQUENCES 

These detection procedures for amplified nucleic acid sequences may be used to 
identify difference of one or more points of variation between a reference and test 
nucleic acid sequence or to compare different polymorphic forms of the ECP gene 
from two or more individuals. 

REFERENCE NUCLEIC ACID SEQUENCES 

As used herein the term "reference nucleic acid sequence" means a control nucleic 
acid sequence such as a control DNA sequence representing one or more individuals 
homozygous for each of the alleles being tested in that assay. By way of example, 
control DNA sequences may include but are not limited to: (i) a genomic DNA from 
homozygous individuals; (ii) a PCR product containing a relevant SNP amplified from 
homozygous individuals; or (iii) a DNA sequence containing a relevant SNP that has 
been cloned into a plasmid or other suitable vector. The control sample may also be an 
allelic ladder comprising a plurality of alleles from known set of alleles. There may 
be a plurality of control samples, each containing different alleles or sets of alleles. 
Other reference/control samples typically include diagrammatic representations, 
written representations, templates or any other means suitable for identifying the 
presence of a polymorphism in a PCR product or other fragment of nucleic acid. The 
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terms "reference nucleic acid sequence", reference samples and control samples are 
used interchangeable throughout the text. 

Therapeutic uses 

5 

An aspect of the invention provides a method of screening an individual for a 
predisposition to cancer or cancer-related disorders, in particular, eosinophil-related 
cancers such as Hodgkin's disease, and if a polynucleotidetic predisposition is 
identified, treating that individual to delay, reduce or prevent the onset of cancer. 

10 

In particular, the present invention provides a method of screening for a predisposition 

to or a method of treating cancer or cancer-related disorders such as: acidophil 

adenomas; acoustic neuromas; adenocarcinomas; adenolymphomas; adenomas; 

adrenal gland tumours; ameloblastomas; anaplastic carcinomas of the thyroid; 
1 5 angiofibromas; angiosarcomas; apudomas; argentaffinomas; arrhenoblastomas; ascites 

tumours; ascitic tumours; astroblastomas; astrocytomas; atrial myxomas; 

bacteriophytomas; basal cell carcinomas; bone cancers; brainstem gliomas; brain 

tumours; breast cancers; Burkitt's lymphoma; carcinomas; cerebellar astrocytomas; 

cervical cancers; cherry angiomas; cholangiocarcinomas; chondromas; 
20 choriocarcinoma; CNS leukaemias; colon cancers; common acute lymphoblastic 

leukaemias; craniopharyngiomas; cutaneous T-cell lymphomas; cystomas; cytomas; 

ductal carcinomas in situ; ductal papillomas; dysgemiinomas; encephalomas; 

endometrial carcinomas; endotheliomas; eosinophilic granulomas; ependymomas; 

epitheliomas; erythroleukaemias; Ewing's sarcoma; extra nodal lymphomas; 
25 fibroadenomas; fibrocystic diseases; fibromas; follicular cancers of the thyroid; 

gangliogliomas; gastrinomas; gliomas; gonadoblastomas; haemangioblastomas; 

haemocytomas; hairy cell leukaemias; hamartomas; hepatomas; histomas; Hodgkin's 

disease; hypernephromas; infiltrating ductal cell carcinomas; insulinomas; Kaposi 

sarcomas; kidney tumours; large cell lymphomas; lipomas; liver cancers; Lucke 
30 carcinomas; lymphadenomas; lymphangiomas; lymphocytic leukaemias; lymphocytic 

lymphomas; lymphocytomas; lymphoedemas; lymphomas; medulloblastomas; 

meningiomas; mesotheliomas; Morton's neuromas; myelomas; myomas; myxomas; 

nasopharyngeal carcinomas; nephroblastomas; neuromas; non-Hodgkin's lymphoma; 
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oligodendrogliomas; optic gliomas; osteomas; ovarian cancers; Paget's disease of the 
nipple; pancreatic cancers; papillomas; phaeochromocytomas; plasmacytomas; 
progonomas; prolactinomas; renal cell carcinomas; retinoblastomas; 
rhabdomyosarcomas; rhabdosarcomas; sarcomas; skin cancers; squamous cell 
5 carcinomas; strawberry haemangiomas; teratomas; testicular cancers; thymomas; 
veno-occlusive disease; vestibular schwannomas; and Wilm's tumour. In particular, 
the present invention provides a method of screening for a predisposition to or a 
method of treating eosinophil-related cancers such as Hodgkin's disease. 

10 In an embodiment of this aspect of the invention, the predisposition of an individual to 
cancer is assessed by determining whether that individual is homozygous for an ECP 
encoding polynucleotide in which nucleotide 926 is guanosine, is heterozygous for 
this polynucleotide and the polymorphism in which guanosine at position 926 is 
replaced by cytosine, or is homozygous for the polymorphism using methods of 

1 5 detection discussed above. 

Thus, an individual who is G/G homozygous at position 926, for the polymorphism is 
classified as being at highest risk. An individual being G/C heterozygous is classified 
as having moderate risk. An individual being C/C homozygous is classified as being 
20 in the lowest risk category. 

Optionally, the assessment of an individual's risk factor is calculated by reference both 
to the presence of a ECP encoding polynucleotide polymorphism and also to other 
known polynucleotidetic or physiological or dietary or other indications. The 
25 invention in this way provides further information on which measurement of an 
individual's risk can be based. 

General methodology reference 
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Although in general the techniques mentioned herein are well known in the art, 
reference may be made in particular to Sambrook et aL, Molecular Cloning, A 
Laboratory Manual (1989) and Ausubel et ah, Short Protocols in Molecular Biology 
(1999) 4 th Ed, John Wiley & Sons, Inc. 

5 

Brief Description of the Figures 

Figure 1 shows the pattern of Pst I digest fragments of samples containing ECP- 
encoding DNA indicative the presence/absence of sequences of the present invention. 

10 

Figure 2 shows Erythrocyte sedimentation rate (ESR) according to the arg97thr ECP 
genotype (wild-type, heterozygous and homozygous). 

Figure 3 shows the RNase activity of wild-type ECP and of ECP mutation 97 as 
1 5 determined by an RNase activity assay. 

EXAMPLES 

Example 1: Identification of SNP in the human ECP gene 

20 

Material and Methods 
Subjects 

5 mL EDTA blood was drawn from a mixed population of 70 individuals, medical 
25 students and laboratory employees after their written consent 

PNA Preparation 

200 juiL blood were used for DNA preparations as has been described in Kawasaki, 
1990, (PCR Protocols, A Guide to Methods and Protocols, Ed. MAInnis ppl46-152, 
30 Academic Press, San Diego) with minor modifications. The blood was mixed with 
500 iiL 10 mM Tris, 0.1 mM EDTA (pH 8.0) and centrifuged for 10 seconds at 
lSOOOg* and the supernatant was discarded. This procedure was repeated three times, 
until all red blood cells were lyzed. The cell pellet was resuspended in 100 juL 
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Proteinase K-buffer (50 mM KC1, 20 mM Tris-HCl (pH 8.3), 2.5 mM MgCl 2 , 0.5% 
Tween 20, 100 jag/mL Proteinase K) and incubated at 56°C for 2 hours. Subsequently 
the samples were heated to 95°C for 10 minutes to inactivate the proteases. DNA 
concentration and purity was measured at 260 and 280 nm in a SPECTRAmax™ 250 
5 Microplate Spectrophotometer System (Molecular Devices, USA). 

PCR 

-100 ng DNA was used in a 50 ]xL PCR reaction containing 1.0 U Taq DNA 
polymerase and buffer from Life Technologies (Gaithersburg, MD), 1.5 mM MgCh, 
0.2 jiM dNTP, and 20 pmol primers. All base positions refer to Genebank accession 
number X16545 (SEQ ID No. 1). The biotinylated 3' primer, 5 r -biotin- 
ggacagttgctgatacccagagtac-3 ' (SEQ ID No. 5), between positions 5'- 1138 to 3'- 1114 
and the 5' primer, 5'-gtgtgtcataaccgagaccggatag-3' (SEQ ID No. 6), between 
positions 5 '-495 to 3 '-5 19, amplified together a 644 bp fragment spanning 56 bp of 
the intron, the prepeptide (86 bp), the protein coding part (399+3) and 100 bp of the 3' 
UTR (untranslated region). The same PCR reactions were set up with the 5' primer 
biotinylated instead of the 3 'primer. The PCR reactions were subjected to PCR in an 
Idaho technology PCR machine (Idaho Falls, Idaho) with the following PCR profile 
30 cycles of 96°C for 30 seconds, 51°C for 30 seconds and 74 °C for 1 minute. This 
profile was followed by 5 minutes at 74 °C. 5 \xL of the PGR reactions were visualised 
on an 1 % agarose-gel. 

DNA sequencing and Analysis 

The remaining PCR-reactions were subjected to DNA sequencing using an 
25 Amersham-Pharmacia-Biotech ALF-express DNA sequencer. The 45 \xl biotinylated 
PCR fragments were bound to streptavidin coated combs according to the 
manufacturers instructions. The DNA strands were separated and the biotinylated 3' 
strands were subjected to Sanger-dideoxy-sequencing using Cy5 labelled sequencing 
primers, T7 DNA polymerase and other components of the Auto Load SPS kit 
30 (Amersham Pharamacia Biotech, Uppsala, Sweden). Sequencing from the 5' end was 
performed with the primer, 5 r -Cy5~tctgcttcttctgttggggcttatg-3\ binding to the DNA 
sequence coding for the pre-peptide (Pos. 588-612). Sequencing from the 3' end was 
performed on fragment biotinylated in the 5 '-end. The 3' sequence primer was 3'- 
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Cy5-gatcttggctatgattgaggagctt-3 ' located in the 3'UTR (position 5'- 1101 to 3'- 1077). 
The combs were placed in the wells of an acrylamide sequencing gel (ready-gel, AP- 
Biotech) to release the sequence products. 

5 The sequence gel was run for 700 minutes at 1500 V and subsequently the sequence 
raw-data was exported from the sequence-program to the evaluation program AlfWin 
(Amersham Pharmacia Biotech). The DNA sequences were analysed using the 
software DNASIS together with studies of the sequence peak-pattern in the analyse 
program AlfWin. 

10 

Approximately 460 bp was analysed by alignment of the sequences sequenced in 5 ' 
and 3' directions, containing the entire gene coding for the mature protein (399bp) and 
some sequence in the pre-peptide and the 3 'UTR. 

15 Endonuclease restriction digestion 

17 jxL non-biotinylated PCR reaction was incubated with either 10U Cla I or 10U Pst 
I in an appropriate digestion buffer (Life Technologies). The samples were digested 
over night and subsequently analysed on an 1.5% agarose gel containing ethidium 
bromide. 

20 

Results 

To ensure that the highly homologous DNA region of the EPX/EDN (RNASE2) gene 
had not been co-amplified, the 644 bp PGR fragment was subjected to the ECP gene 
specific Cla I endonuclease digestion. The digestion gave rise to complete digestion 
25 (and two bands of sizes 241 and 403) showing that only the region containing the ECP 
gene had been amplified. 

The sequence-analysis of the region on chromosome 14 containing the ECP gene of the 
70 subjects gave rise to a heterogenous result. Single base substitutions were 
30 discovered at position 775, 926 and 1054. The base pair substitution at position 775 
gave a shift of amino acid 45 from Arg to Cys (CGT -> TGT). The base substitution at 
position 926 gave a shift of amino acid 97 from Arg to Thr (AGG->ACG). The base 
substitution at 1054 was located in the 3'UTR. The base substitution at position 775 
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was present only in a heterozygous form,, while the base substitution at position 926 
was present as both heterozygous and homozygous forms. 

The variant located at position 926 was found to be located at a restriction 
5 endonuclease site specific for the enzyme Pst I (CTGCAG changed to CTGCAC). 
This base-change inhibits the DNA cleaving activity of the enzyme. Therefore the 
material screened by sequencing were RFLP-analysed by Pst I digestion. The 644 
fragment was cleaved into fragments of 213 and 431 bp as shown in Figure 1. The 
wild type form gave rise to complete cleavage of the fragment while the homozygous 
1 0 form did not cleave at all. The heterozygous form of the base substitution gave rise to 
an intermediate, both cleaved and uncleaved fragments (Figure 1). The result of the 
clevages of the 70 individuals was in accordance with the sequencing results. 

We have shown in this study that gene variants of ECP do exist. Thus, we found two 
15 major base changes, which both gave rise to changes in the amino acid sequence. The 
change of arginine to threonine at amino acid position 97 seemed to be very common 
with a prevalence of almost 50% in a population study. Thus, 53% of the subjects 
investigated had the wild type with arginine in the 97 position and 8% being 
homozygous with substitution of arginine with threonine. The remaining subjects 
20 were heterozygous with respect to the mutation. 

Example 2: Correlation of the ECP gene polymorphism to Hodgkin's disease 

Patients and clinical characteristics 

25 The analysis was made on 43 patients with Hodgkin's disease (mean age 33, range 3- 
79), 30 treated at the Department of Oncology and 13 mainly at the Children's 
Hospital, University Hospital, Uppsala, Sweden, between Jan 1986 and May 1998. 
The male/female ratio was 1:1. Frozen material from a diagnostic tumour biopsy from 
a lymph node was available in all patients. Histological classification was made 

30 according to the Rye-classification (Lukes RJ, Butler JJ. The pathology and 
nomenclature of Hodgkin's disease. Cancer Res. 1966;26(6): 1063-83). Before the 
analysis all cases were re-evaluated and classified according to the REAL- 
classification (Hams NL, Jaffe ES, Stein H, Banks PM, Chan JK, Cleary ML et al. A 
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revised European-American classification of lymphoid neoplasms: a proposal from 
the International Lymphoma Study Group. Blood 1994;84(5): 1361-92). 

The clinical information was obtained from the clinical records. Clinical and 
5 pathological stagings were made according to the Ann Arbor system (Carbone PP, 
Kaplan HS, Musshoff K, Smithers DW, Tubiana M. Report of the Committee on 
Hodgkiris Disease Staging Classification. Cancer Res. 1971;31(11):186(M). Bulky 
disease was defined as a tumour diameter exceeding 1 0 cm or a mediastinal tumour 
equal to or more than 1/3 of the thorax diameter at the Th. 5-6 level. Complete 
1 0 remission (CR) was defined as disappearance of all known disease. The distributions 
of stage and histology are seen in Table 1 . 



Stage and 
Histology: 


Number of 
patients: 


Number 
of wild type 


Number of 
heterozygous 


Number of 
homozygous 


p-value 


Total 


43 


25 


16 


2 




Stage 












MA 


26 


13 


11 


2 


0.2 


HB-IV 


17 


12 


5 


0 




Histology 












All Non-NS 


23 


10 


12 


1 


<0.05 


NS 


20 


15 


4 


1 




MC 












LD 


0 


0 


0 


0 




LPn 












IPS 












<3 factors 


35 


20 


13 


2 




>3 factors 


8 


5 


3 


0 





The international prognostic score (IPS) was used as described by Hasenclever and 
15 Diehl (1998 A prognostic score for advanced Hodgkin's disease. International 
Prognostic Factors Project on Advanced Hodgkin's Disease. N. Engl. J. Med.; 
339(21): 1506-14). Parameters in the international prognostic score are serum albumin 
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level (<4g/dl), blood haemoglobin level (<10.5g/di)., sex (male), age (>45 years), stage 
(IV), leukocytosis (WBC>15 5 000/mm 3 ) ? and lymphocytopenia (<600/mm 3 or <8% of 
WBC). In 29 of the patients either lymphocyte count and/or S-albumin was missing. 
All parameters were available in 14 of the patients, 6 were available in 15, and 5 in 14 
5 of the patients. However, all patients were included in the analysis. 

The patients treated at the Department of Oncology were treated according to the 
recommendations in the Swedish care programme for HD (Glimelius B, Enblad G, 
Kalkner M, Gustavsson A, Jakobsson M, Branehog I et al. Treatment of Hodgkin's 
10 disease: the Swedish National Care Programme experience. Leuk.Lymphoma 
1996;21(l~2):71-8). The patients at the Children's Hospital were treated according to 
the principles for adults before 1996, and according to the recommendations in the 
GPOH-HD study protocol since 1996 (Gesellschaft fur Padiatrische Onkologie und 
Hamatologie, 1995). All patients but 3 were in CR after the treatment. 

15 

General population study subjects 

DNA was prepared from EDTA-blood of 70 apparently healthy medical students and 
laboratory employees after their written consent 

20 DNA Preparation 

200 \iL blood were used for DNA preparations as has been described in "PCR Primer: 
A Laboratory Manual," (Eds Dieffenbach CW & Dveksler GS, Cold Spring Harbour 
Press, New York 1995) with minor modifications. The blood was mixed with 500 |xL 
10 mM Tris, 0.1 mM EDTA (pH 8.0) and centrifuged for 10 seconds at 15 OOOg and 

25 the supernatant was discarded. This procedure was repeated three times, until all red 
blood cells were lyzed. The cell pellet was resuspended in 100 \xL Proteinase K-buffer 
(50 mM KC1, 20 mM Tris-HCl, pH 8.3), 2.5 mM Tween 20, 100 jig/mL Proteinase K) 
and incubated at 56°C for 2 hours. Subsequently the samples were heated to 95°C for 
10 minutes to inactivate the proteases. DNA concentration and purity was measured at 

30 260 and 280 nm in a SPECTRAmax™ 250 Microplate Spectrophotometer System 
(Molecular Devices, USA). 
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PGR 

100 ng DNA was used in a 50 |uL PCR reaction containing 0.5 U Taq polymerase and 
buffer from Life Technologies (Gaithersburg, MD), 1.5 niM MgCl, 2nM dNTP, and 20 
pmol primers. All base positions refer to Genebank accession number XI 6545 (SEQ 
5 ID No. 1). The biotinylated 3 ' primer, 5'-biotin-ggacagttgctgatacccagagtac-3' (SEQ ID 
No. 5), between positions 5'-1138 to 3'-l 1 14 and the 5' primer, 5'- 
gtgtgtcataaccgagaccggatag-3 ' (SEQ ID No. 6) between positions 5 '-495 to 3 '-5 19, 
amplified together a 644 bp fragment spanning 56 bp of the intron, the prepeptide (86 
bp), the protein coding part (399+3) and 100 bp of the 3' UTR (untranslated region). 
1 0 The same PCR reactions were set up with the 5 'primer biotinylated instead of the 
3 primer. The PCR reactions were subjected to PCR in an Idaho technology PCR 
machine (Idaho Falls, Idaho) with the following PCR profile 30 cycles of 96°C for 30 
seconds, 51°C for 30 seconds and 74°C for 1 minute. This profile was followed by 5 
minutes at 74°C. 5 pJL of the PCR reactions were visualised on a 1 % agarose-gel. 

15 

DNA sequencing and Analysis 

The remaining PCR-reactions were subjected to DNA sequencing using an 
Amersham-Pharmacia-Biotech ALF-express DNA sequencer. The 45 \\L biotinylated 
PCR fragments were bound to streptavidin-coated combs according to the 

20 manufacturers instructions. The DNA strands were separated and the biotinylated 3' 
strands were subjected to Sanger-dideoxy-sequencing using Cy5 labelled sequencing 
primers, T7 DNA polymerase and other components of the Auto Load SPS kit 
(Amersham Pharamacia Biotech, Uppsala, Sweden). Sequencing from the 5 'end was 
performed with the primer, 5'-Cy5-tctgcttcttctgttggggcttatg-3', binding to the DNA 

25 sequence coding for the pre-peptide (Pos. 588-612). Sequencing from the 3 'end was 
performed on fragment biotinylated in the 5 '-end. The 3 'sequence primer was 3'-Cy5- 
gatcttggctatgattgaggagctt-3 ' located in the 3 'UTR (position 5M101 to 3'- 1077). The 
combs were placed in the wells of an acrylamide sequencing gel (ready-gel, AP- 
Biotech) to release the sequence products. 

30 

The sequence gel was run for 700 minutes at 1500 V and subsequently the sequence 
raw-data was exported from the sequence-program to the evaluation program AlfWin 
(Amersham Pharmacia Biotech). The DNA sequences were analysed using the 
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software DNASIS together with studies of the sequence peak-pattern in the analyse 
program AlfWin. 

Approximately 460 bp were analysed by alignment of the sequences sequenced in 5 ' 
5 and 3 ' directions, containing the entire gene coding for the mature protein (399bp) and 
some sequence in the pre-peptide and the 3 'UTR. 

Endonuclease restriction digestion 

17 \\L non-biotinylated PCR reaction was incubated with 10U Pst I in an appropriate 
10 digestion buffer (Life Technologies). The samples were digested over night and 
subsequently analysed on a 1 .5% agarose gel containing ethidium bromide. 

To verify that the highly homologous DNA region of the EPX/EDN (RNASE2) gene 
had not been co-amplified, the 644 bp PCR fragment was subjected to the ECP gene 

1 5 specific Cla I endonuclease digestion (Hamann KJ ? Ten RM, Loegering D A, Jenkins 
RB, Heise MT, Schad CR et al. Structure and chromosome localization of the human 
eosinophil- derived neurotoxin and eosinophil cationic protein genes: Evidence for 
intronless coding sequences in the ribonuclease gene superfamily. Genomics 
1990;7:535-46). The digestion gave rise to complete digestion (and two bands of sizes 

20 241 and 403) showing that only the region containing the ECP gene had been 
amplified. 

Statistical methods 

To compare differences in frequencies between groups the Fisher's exact test was 
25 used. The Mann Whitney U Test was used as a non-parametric test. Histology was 
included in the analysis as NS versus non-NS. Survival and relapse-free survival in 
different groups were analysed using the log-rank test. IPS was analysed as 2 groups 
(0-2 factors and 3 or more factors). The statistics were performed with the Statistica 
5.5 software, Statsoft, Tulsa, USA. 

30 
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Results 

Identification of polymorphisms in the ECP gene 

The sequence-analysis of the region on chromosome 14 containing the ECP gene of 
the 70 apparently healthy subjects gave rise to a heterogeneous result. Single base 
5 substitutions were discovered at position 775, 926 and 1054 (according to SEQ ID No. 
1, Hamann et al - supra). The base pair substitution at position 775 gave a shift of 
amino acid 45 from Arg to Cys (CGT -> TGT). The base substitution at position 926 
gave a shift of amino acid 97 from Arg to Thr (AGG->ACG). The base substitution at 
1054 was located in the 3TJTR. The base substitution at position 775 was present only 
10 in a heterozygous form, while the base substitution at position 926 was present as both 
heterozygous and homozygous forms. 

The variant located at position 926 was located at a restriction endonuclease site 
specific for the enzyme Pstl (CTGCAG changed to CTGCAC). This base-change 

15 inhibits the DNA cleaving activity of the enzyme. Therefore the material screened by 
sequencing were RFLP-analysed by Pstl digestion. The 644 fragment was cleaved into 
fragments of 213 and 431 bp as shown in Figure 1. The wild type form gave rise to 
complete cleavage of the fragment while the homozygous form did not cleave at all. 
The heterozygous form of the base substitution gave rise to an intermediate, both 

20 cleaved and uncleaved fragments (Figure 2). The result of the cleavages of the 70 
individuals was in accordance with the sequencing results. 



Two major base changes give rise to changes in the amino acid sequence. The change 
of arginine to threonine at amino acid position 97 is very common with a prevalence 
25 of almost 50% in a population study. Thus, 53% of the subjects investigated had the 
wild type with arginine in the 97 position and 8% being homozygous with substitution 
of arginine with threonine. The remaining subjects (39%) were heterozygous with 
respect to the mutation. Only two subjects, heterozygous with respect to the 
substitution of arginine at position 45, were found. 

30 

Polymorphism of the ECP gene in Hodgkin 's disease 

Forty-three patients were analysed with respect to the presence of the arg97thr 
mutation. Twenty- five (58%) were wild type, 16 (37%) heterozygous and 2 (5%) 
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homozygous. There were no statistically significant differences between the 
distribution of the different genotypes according to sex or presence of bulky disease. 
This distribution is similar to that in a general population. The distribution of 
genotypes according to stage, histological subtype, and international prognostic 
5 indices are shown in Table 1 . There was no relation between the arg97thr genotype 
and stage of the disease, whereas the wild type was significantly more prevalent 
among patients with nodular sclerosis than among non-nodular sclerosis patients (75 
vs. 43 % 9 p<0.05). According to the international prognostic score (Hasenclever D, 
Diehl V. A prognostic score for advanced Hodgkin's disease. International Prognostic 

10 Factors Project on Advanced Hodgkin's Disease. N.EnglJ.Med. 1998;339(21).T506- 
14) 28 of the patients had less than 3 risk factors and 7 had 3 or more, but with no 
relation to the arg97thr genotype. Erythrocyte sedimentation rate (ESR) according to 
genotype is seen in Figure 2 and shows a significant relationship to the arg97thr 
genotype (p=0.009), with higher ESR in patients with wild type. Also blood 

15 haemoglobin and serum albumin tended to be lower in wild type, p=0.09 and 0.1, 
respectively. 

This shows that the ECP gene contains several different mutations, two of which give 
rise to an amino acid substitution in the ECP protein. The most common seems to be 

20 the substitution of an arginine at position 97 to a threonine in the amino acid sequence 
of the ECP molecule. In fact, almost half the population has this substitution either as 
a heterozygous variant or as a homozygous variant. This mutation was easily 
diagnosed based on the digestion of the wild type DNA sequence with the restriction 
enzyme Pstl and confirmed by means of regular DNA sequencing. The polymorphism 

25 is disease related, and results show a relationship to type and prognosis in a group of 
patients with Hodgkin's disease. 



The degree of eosinophilic infiltration in Hodgkin's disease has been shown to 
coirelate to the histological subtype with more eosinophils in NS (Enblad G s 
30 Sundstrom C, Glimelius B. Infiltration of eosinophils in Hodgkin's disease involved 
lymph nodes predicts prognosis. Hematol. Oncol. 1993;11(4):1 87-93; Toth J ? Dworak 
O, Sugar J. Eosinophil predominance in Hodgkin's disease. Z. Krebsforsch. Klin. 
Onkol. Cancer Res. Clin. Oncol. 1977;89(1): 107-11; and von Wasielewski R ? Seth S, 
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Franklin J, Fischer R, Hubner K, Hansmann ML et al. Tissue eosinophilia correlates 
strongly with poor prognosis in nodular sclerosing Hodgkin's disease, allowing for 
known prognostic factors. Blood 2000;95(4): 1207-1 3). The secretion of Transforming 
Growth Factor p (TGF(3) from eosinophils may explain this connection, as TGFp 
5 stimulates fibroblasts (Newcom SR, Tagra KK, Kadin ME. Neutralizing antibodies 
against transforming growth factor beta potentiate the proliferation of Ki-1 positive 
lymphoma cells. Further evidence for negative autocrine regulation by transforming 
growth factor beta. AmJ.PathoL 1992;140(3):709-18; and Kadin M, Butmarc J, 
Elovic A, Wong D. Eosinophils are the major source of transforming growth factor-b 

10 1 in nodular sclerosing Hodgkin's disease. AmJ.PathoL 1993;142 No.l:ll-6). High 
levels of heavy molecular weight TGF(3 have been detected in the urine from patients 
with NS (Newcom SR, Tagra KK. High molecular weight transforming growth factor 
beta is excreted in the urine in active nodular sclerosing Hodgkin's disease. Cancer 
Res. 1992;52(24):6768-73), and a significant amount of the TGFp in patients with NS 

15 originates from eosinophils (Kadin et al - supra). ECP inhibits proteoglycan 
degradation in fibroblasts, and this indicates a role for ECP in fibrosis (Hernas J, 
Sarnstrand B, Lindroth P, Peterson CGP, Venge P, Malmstrom A. Eosinophil cationic 
protein alters proteoglycan metabolism in human lung fibroblast cultures. EurJ.Cell 
Biol. 1992;59:352-63). In spite of the fact that NS in general is associated with a good 

20 prognosis, patients with tumour eosinophilia and NS have an unfavourable pattern of 
survival (Enblad et al - supra; von Wasielewski et al - supra). This implicates a 
profoundly different role of eosinophils in HD compared to other tumours. In a 
previous study a significant association between S-ECP and NS histology was found 
(Molin D, Enblad G, Sundstrom C, Venge P, Glimelius B. The serum levels of 

25 eosinophil cationic protein (ECP) are related to the infiltration of eosinophils in the 
tumours of patients with Hodgkin's Disease. Accepted for publication in Leukemia 
and Lymphoma). 

It is consequently clear that genetic polymorphisms in the gene coding for ECP are 
common and that the wild type variant of the ECP gene is associated with the nodular 
30 sclerosis type of disease in patients with Hodgkin's disease. Carriers of the wild type 
variant may therefore have a poorer prognosis. The ECP molecule takes part in 
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fibrotic processes in vivo and in keeping with previous notions of a relationship 
between fibroblast and eosinophil activities. 

Example 3: Recombinant ECPRNase activity assay 

5 Material and Methods 

Recombinant ECP was produced in the Baculovirus system. One recombinant product 
was the wild type ECP and the other the ECP mutation 97. The RNase activity of the 
ECP variants was tested by a commercial kit (RnaseAlert Lab Test Kit, described 
above) according to the instructions of the manufacturer (Ambion Inc. Texas, USA). 
1 0 The preparations were diluted in RNase buffer to a concentration of 1 0 ng/ml of ECP 
as measured by a specific radio immunoassay (Pharmacia Diagnostics, Uppsala, 
Sweden). Baculovirus supematants with no expressed ECP were used as controls. 

The control preparations contained endogenous RNase activity. This activity was 
1 5 subtracted from the activity of the preparations containing recombinant ECP. 

Results 

The results are shown in Figure 3. Wild type recombinant ECP has significant RNase 
activity, whereas the ECP mutation 97 has no activity. In contrast ECP mutation 97 

20 inhibits the endogenous activity of the protein-rich and RNase containing supernatant. 
The results are means of 6 experiments with very similar kinetics. The results shown 
in Figure 3 confirm that ECP mutation 97 may be used as an effective antagonist of 
wild-type ECP in the prevention and/or treatment of cancers and cancer-related 
disorders, in particular, eosinophil-related cancers. The assay of Example 3 may be 

25 used to identify other such ECP antagonists. 

Example 4: Relation of ECP polymorphism to Hodgkin's disease (disease stage 
and survival) 

30 Patients and clinical characteristics 

Altogether, 125 patients with diagnosed Hodgkin lymphoma were included in the 
study. Fifty-five patients were below the age of 40 and 70 were above. The patients 
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were classified as to type of disease based on histology (i.e. non nodular-sclerosis or 
nodular sclerosis type) and to disease stage (i.e. as to the spread of the disease with 
stage I only affecting one lymph node and the most widespread stage i.e. stage IV, 
affecting several lymph nodes and other organs in the body). Blood was taken from 
5 the patients for the measurements of Blood Hemoglobin (B-Hb) and Erythrocyte 
sedimentation rate (ESR). DNA was prepared for genotyping from lymph nodes 
biopsy material. 

Materials and methods 

10 Genotyping was performed as described above i.e. after preparation of DNA from 
lymph node material. The ECP-polymorphism with respect to the 926 nucleotide was 
typed by cleavage of the PCR-product with the restriction enzyme Pstl and 
subsequent electrophoresis of the nucleotide sequences in an agarose gel. The G 
homozygous genotype was completely cleaved whereas the C homozygous genotype 

1 5 remained uncleaved. The GC heterozygote variant was partially cleaved. 

Statistics 

Non-parametric analysis was used to compare groups. The Chi2 test was used to 
compare proportions. 

20 

Results 

Of the 125 patients with Hodgkin lymphoma, 79 had the nodular sclerosing type and 
46 had the non-nodular sclerosing type of disease. The ECP GG genotype has been 
linked to nodular sclerosis (p=0.03), and to progress of the disease. 73 patients had 
25 stages I-II and 51 had stages Ilb-IV. Thus ? the ECP polymorphism is related to higher 
stages of the disease with the G allele being found to be particularly common in ID3- 
IV stages of the disease (p=0.056). 

71 patients (56%) were found to be homozygous for the G allele, 38 patients (30%) 
30 were found to be heterozygous for the G and C alleles and 17 patients (13%) were 
found to be homozygous for the C allele. 
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Overall 22 patients died during the observation period. The survival time from 
diagnosis to death was significantly shorter in patients with the G allele homozygotes 
(619 days) as compared to heterozygotes and C allele homozygotes (1392 days) 
(p=0.02). This confirms that presence of the G allele is related to poor survival. 

Overall there was a relationship between B-Hb and ESR and the ECP polymorphism 
i.e. B-Hb was significantly lower (p=0.08) and ESR was significantly higher 
(p=0.009) in G allele homozygotes as opposed to heterozygotes and C allele 
homozygotes (p=0.02). This shows a relationship between wild type ECP and 
secondary prognostic signs. 

In patients below the age of 40 there was a significant relationship between the ECP 
polymorphism and progress of the disease. Thus, those with two copies of the G allele 
(GG) had higher stages than those having a single (CG) or no copies (CC) of this 
allele (p=0.005). Thus, the relationship between the ECP-polymorphism and 
prognosis in Hodgkin lymphoma seems to be age related and most obvious in patients 
below 40 years of age. Having the GG genotype in Hodgkin Lymphoma is therefore 
unfavorable as to prognosis. 
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CLAIMS 

1 . Use of a protein selected from; 
5 (a) an ECP, 

(b) a mutant of said ECP, and 

(c) a fragment of (a) or (b), 

10 

said protein lacking an arginine 97 of wild-type ECP or its equivalent, in the 
preparation of a medicament for the prevention and/or treatment of cancer or cancer- 
related disorders. 

15 2. Use of a DNA molecule encoding a protein selected from; 

(a) an ECP, 

(b) a mutant of said ECP, and 

20 

(c) a fragment of (a) or (b), 

said protein lacking an arginine 97 of wild-type ECP or its equivalent, or of a protein 
encoded thereby, in the preparation of a medicament for the prevention and/or 
25 treatment of cancer or cancer-related disorders. 

3. Use according to claim 1 or 2 wherein arginine 97, or its equivalent, is 
replaced by threonine or a bioisostere thereof. 

30 4. Use of an Eosinophil cationic protein (ECP), or a fragment thereof, comprising 
a modification at amino acid residue 97 or its equivalent such that residue 97 or its 
equivalent is any amino acid other than arginine; in the preparation of a medicament 
for the prevention and/or treatment of cancer or cancer-related disorders. 
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5. Use of a protein having the amino acid sequence as shown in SEQ LD. No. 1, 
or a fragment thereof, wherein residue 97 is any amino acid other than arginine; in the 
preparation of a medicament for the prevention and/or treatment of cancer or cancer- 

5 related disorders. 

6. Use according to claims 4 or 5 wherein residue 97 or its equivalent is 
threonine or a bioisostere thereof. 

10 7. Use according to any of claims 1-6, wherein the protein includes the sequence 
C T Y corresponding to amino acid residues 96-98 of wild-type ECP, or a biostere 
thereof. 

8. Use according to claim 7, wherein the biostere is of formula CXi Y, wherein 
15 Xi is any amino acid residue other than arginine. 

9. Use of a protein having SEQ ID No. 4 in the preparation of a medicament for 
the prevention and/or treatment of cancer or cancer-related disorders. 

20 10. Use according to any of claims 1-9 wherein the cancer is an eosinophil-related 
cancer. 

11. Use according to any of claims 1-10 wherein the eosinophil-related cancer is 
Hodgkin ? s Disease. 

25 

12. A method of detecting a predisposition to cancer or a cancer-related disorder or 
an increased likelihood of having such a disorder comprising determining in a 
biological sample, the absence of a protein selected from: 

30 (a) an ECP, 

(b) a mutant of said ECP, and 
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(c) a fragment of (a) or (b), 

said protein lacking an arginine 97 of wild-type ECP or its equivalent, or a nucleotide 
sequence encoding such a protein. 

5 

13. A method according to claim 10 wherein arginine 97, or its equivalent, is replaced 
by threonine or a bioisostere thereof. 

14. A method according to claim 12 or 13, using an antibody capable of 
10 recognising a said protein. 

15. A method according to claim 12 or 13, comprising detecting the presence of a 
G to C polymorphism at position 926 on an ECP encoding polynucleotide. 

15 16. A method according to claim 15, wherein the polymorphism is detected using 
PCR primers which amplify a DNA segment comprising a nucleotide at position 926 
on an ECP encoding polynucleotide. 

17. A method according to claim 16, wherein the PCR primers are SEQ ID NO: 5 
20 and SEQ ID NO: 6. 

18. A method according to claim 15 comprising making a Pst 1 digest of a sample 
believed to contain said WT-ECP DNA or nucleotide sequence and detecting the 
fragments produced. 

19. A diagnostic kit comprising the pair of PCR primers of SEQ ID NO: 5 and 
SEQ ID NO: 6. 

20. A diagnostic kit comprising an antibody as defined in claim 14. 

30 

21 . A kit comprising a diagnostic kit according to claim 1 9 or 20, and a therapeutic 
agent. 
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20. A kit according to claim 21 , wherein the therapeutic agent is for the treatment 
of cancer or a cancer-related disorder. 

21 . A kit according to claim 20 wherein the cancer is an eosinophil-related cancer. 

5 

22. A kit according to claim 21 wherein the eosinophil-related cancer is Hodgkin's 
Disease. 
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Pstl Digestion of PCR fragment from different 
13 individuals. 

Each fragmct contain the protein coding part of the 
Eosinophil Cationic Protein 
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Fig. 2 



ESR according to the arg97thr ECP genotype 
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Fig . 3 



RNase activity of recombinant ECP 



2000 




-2000 , r— r- , 

0 10 20 30 40 

Minutes 



3/3 



WO 02/089829 



PCT/IB02/02555 



SEQUENCE LISTING 

<110> Innoventus Project AB 
<120> SEQUENCES 

5 

<130> P011491WO NJN CAD 

<140> 
<141> 

10 

<160> 6 

<170> Patentln Ver. 2.1 

15 <210> 1 

<211> 1452 

<212> DNA 

<213> Homo sapiens 

20 <400> i 

ctgccagcag cgtatagttt tcacccagag 
cacaggatga cttggaatta gagtccgtat 
tatccgttgc tcagccaagt catcaaataa 
caatctgtgg gtttctgcat ggccagaccc 

25 agtaggggag accagctgcc cctgaacccc 
agccacagct cagagactgg gtaagtcaac 
gacagggcag cacctgaggg agaggtgagc 
tttggggaca ggaagaaaag gaaatgcgac 
tgagacacta tagagtgtgt cataaccgag 

30 tttcttacag gaaacatggt tccaaaactg 
ttggggctta tgggtgtgga gggctcactc 
cagtggtttg ccatccagca catcagtctg 
gcaattaaca attatcgatg gcgttgcaaa 
gctaatgtag ttaatgtttg tggtaaccaa 

35 aacaattgtc atcggagtag attccgggtg 
ggtgcacaga atatttcaaa ctgcaggtat 
gttgcatgtg acaacagaga tccacgggat 
ctggatacca ccatctaagc tcctgtatca 
tcctcaatca tagccaagat cccatccctc 

40 atcagtctcc ataccccttc agctttcctg 
actgctttgc aaattcatct ggaagtgtct 
ttagtgacaa tctgctctag agatttgggt 
accaaattcc ttgattcccc catcatcctt 
aaaaaaaatc caattgagtc aacttagcat 

45 gcaattttcc at 



tccagatccc accggcaaaa ctctgtctaa 60 
agcagaaaga gcagcagggc tgtccttggg 120 
aaaggatgat tgcacaagtg gaccatgtgt 180 
accaagggaa gctttattta aacagttcca 240 
agaacaacca gctggatcag ttctcacagg 300 
aatccccaga gctgggacag gaggggcagc 360 
tgaagttagt gcttaggaga tgtggcacac 420 
cccagagtgg cagcagaggg gcctgtgggt 4 80 
accggatagg ggagtagtta cttctcttct 540 
ttcacttccc aaatttgtct gcttcttctg 600 
catgccagac ccccacagtt tacgagggct 660 
aacccccctc gatgcaccat tgcaatgcgg 720 
aaccaaaata cttttcttcg tacaactttt 780 
agtatacgct gccctcataa cagaactctc 840 
cctttactcc actgtgacct cataaatcca 900 
gcagacagac caggaaggag gttctatgta 960 
tctccacggt atcctgtggt tccagttcac 1020 
gcagtcctca tcatcactca tctgccaagc 1080 
catgtactct gggtatcagc aactgtcctc 1140 
agctgaagtc ccttgtgaac cctgcaataa 1200 
gtgtgtcttc ctcggccgct ctgctgtcat 1260 
ttatcatgaa tctctccccc tcaatatctg 1320 
catgtgatac ctgattccag gcctgcctta 1380 
tggtctccct agccttaata tctcctctaa 1440 

1452 



<210> 2 

<211> 133 

50 <212> PRT 

<213> Homo sapiens 

<400> 2 

Arg Pro Pro Gin Phe Thr Arg Ala Gin Trp Phe Ala lie Gin His lie 
55 1 5 10 15 

Ser Leu Asn Pro Pro Arg Cys Thr lie Ala Met Arg Ala He Asn Asn 

20 25 30 

60 Tyr Arg Trp Arg Cys Lys Asn Gin Asn Thr Phe Leu Arg Thr Thr Phe 

35 4 0 4 5 



Ala Asn Val Val Asn Val Cys Gly Asn Gin Ser He Arg Cys Pro His 
50 55 60 
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Asn Arg Thr Leu Asn Asn Cys His Arg Ser Arg Phe Arg Val Pro Leu 
65 70 75 80 

Leu His Cys Asp Leu lie Asn Pro Gly Ala Gin Asn lie Ser Asn Cys 

85 90 95 

Arg Tyr Ala Asp Arg Pro Gly Arg Arg Phe Tyr Val Val Ala Cys Asp 

100 105 110 

Asn Arg Asp Pro Arg Asp Ser Pro Arg Tyr Pro Val Val Pro Val His 
115 120 125 

Leu Asp Thr Thr lie 
130 



<210> 3 
<211> 1452 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human derived 
nucleotide sequence 

<400> 3 

ctgccagcag cgtatagttt tcacccagag tccagatccc accggcaaaa ctctgtctaa 60 
cacaggatga cttggaatta gagtccgtat agcagaaaga gcagcagggc tgtccttggg 120 
tatccgttgc tcagccaagt catcaaataa aaaggatgat tgcacaagtg gaccatgtgt 180 
caatctgtgg gtttctgcat ggccagaccc accaagggaa gctttattta aacagttcca 240 
agtaggggag accagctgcc cctgaacccc agaacaacca gctggatcag ttctcacagg 300 
agccacagct cagagactgg gtaagtcaac aatccccaga gctgggacag gaggggcagc 360 
gacagggcag cacctgaggg agaggtgagc tgaagttagt gcttaggaga tgtggcacac 420 
tttggggaca ggaagaaaag gaaatgcgac cccagagtgg cagcagaggg gcctgtgggt 480 
tgagacacta tagagtgtgt cataaccgag accggatagg ggagtagtta cttctcttct 540 
tttcttacag gaaacatggt tccaaaactg ttcacttccc aaatttgtct gcttcttctg 600 
ttggggctta tgggtgtgga gggctcactc catgccagac ccccacagtt tacgagggct 660 
cagtggtttg ccatccagca catcagtctg aacccccctc gatgcaccat tgcaatgcgg 720 
gcaattaaca attatcgatg gcgttgcaaa aaccaaaata cttttcttcg tacaactttt 780 
gctaatgtag ttaatgtttg tggtaaccaa agtatacgct gccctcataa cagaactctc 840 
aacaattgtc atcggagtag attccgggtg cctttactcc actgtgacct cataaatcca 900 
ggtgcacaga atatttcaaa ctgcacgtat gcagacagac caggaaggag gttctatgta 960 
gttgcatgtg acaacagaga tccacgggat tctccacggt atcctgtggt tccagttcac 1020 
ctggatacca ccatctaagc tcctgtatca gcagtcctca tcatcactca tctgccaagc 1080 
tcctcaatca tagccaagat cccatccctc catgtactct gggtatcagc aactgtcctc 1140 
atcagtctcc ataccccttc agctttcctg agctgaagtc ccttgtgaac cctgcaataa 1200 
actgctttgc aaattcatct ggaagtgtct gtgtgtcttc ctcggccgct ctgctgtcat 1260 
ttagtgacaa tctgctctag agatttgggt ttatcatgaa tctctccccc tcaatatctg 1320 
accaaattcc ttgattcccc catcatcctt catgtgatac ctgattccag gcctgcctta 1380 
aaaaaaaatc caattgagtc aacttagcat tggtctccct agccttaata tctcctctaa 1440 
gcaattttcc at 14 52 



<210> 4 
<211> 133 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: human derived 
polypeptide sequence 

<400> 4 

Arg Pro Pro Gin Phe Thr Arg Ala Gin Trp Phe Ala lie Gin His lie 
15 10 15 
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Ser Leu Asn Pro Pro Arg Cys Thr lie Ala Met Arg Ala lie Asn Asn 

20 25 30 

Tyr Arg Trp Arg Cys Lys Asn Gin Asn Thr Phe Leu Arg Thr Thr Phe 

35 40 45 

Ala Asn Val Val Asn Val Cys Gly Asn Gin Ser lie Arg Cys Pro His 
50 55 60 

Asn Arg Thr Leu Asn Asn Cys His Arg Ser Arg Phe Arg Val Pro Leu 
65 70 75 80 

Leu His Cys Asp Leu lie Asn Pro Gly Ala Gin Asn lie Ser Asn Cys 

85 90 95 

Thr Tyr Ala Asp Arg Pro Gly Arg Arg Phe Tyr Val Val Ala Cys Asp 

100 105 110 

Asn Arg Asp Pro Arg Asp Ser Pro Arg Tyr Pro Val Val Pro Val His 
115 120 125 

Leu Asp Thr Thr lie 
130 



<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

ggacagttgc tgatacccag agtac 25 



<210> 6 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

gtgtgtcata accgagaccg gatag 25 
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